AMERICAN 
JOURNAL of PHYSICS 


Devoted to the Instructional and Cultural Aspects of Physical Science 


VoLuME 22 NumBeErR 5 


MAY 1954 


ct 


Published nine times per year for the 
AMERICAN ASSOCIATION OF PHYSICS TEACHERS 
by the 
AMERICAN INSTITUTE OF PHYSICS 
Incorporated 


Pamce & Lemon Sts., Lancaster, Pa., or 57 East 55 Sraget, New York 22, N. Y. 


University of Minnesota, Minneapolis, Minnesota,J1 (Sée p. 276). 
1} 


u 
CO 


yo 


C June Meeting of the American Association fi cs feachers, 
{eG Ur? ARS LAL 
woe Bl RY 


















AMERICAN ASSOCIATION OF PHYSICS TEACHERS 


OFFICERS 
Marsu W. Waite, President 


Pennsylvania State University 
State College, Pennsylvania 


R. RONALD PALMER, President- 
Elect 
Beloit College 
Beloit, Wisconsin 


R. F. Paton, Secretary 


University of Illinois 
Urbana, Illinois 


Francis W. Sears, Treasurer 
Massachusetts Institute of 


Technology 
Cambridge, Massachusetts 


EXECUTIVE COMMITTEE 


V. L. BoLLMAN 
V. E. Eaton 
P. E. KLopsteG 
W. C. MICHELS 
T. H. OsGoop 
W. S. WEBB 


The officers and editor are also 
members of the executive committee. 


COUNCIL 


R. T. ELLIcKsSON 
W. GEER 

S. O. Grimm 

J. J. HEILEMANN 
G. Q. LEFLER 
R. R. MEIJER 
W. NoLi 

R. R. PALMER 
T. D. PHILutPs 
R. L. Price 

F. VERBRUGGE 
I. WALERSTEIN 
L. R. WEBER 

C. WILLIAMSON 


The members of the executive com- 
mittee are also members of the 
council. 


Entered as second-class matter April 5, 1948, at the post office at Lancaster, Pa., under the Act of August 24, 1912. 


AMERICAN JOURNAL OF PHYSICS 


Editor 
Tuomas H. OsGcoop, Michigan State College 
Assistant Editor 
B. H. Dickinson, Michigan State College 
Associate Editors . 
RICHARD A. BETH, Western Reserve University 
Davi L. FaLxkorr, Brandeis University and Massachusetts 
Institute of Technology 
IRA M. FREEMAN, Rutgers University 
A. C. HELtmHoLz, University of California, Berkeley 
Nora M. Mos ter, Smith College 
J. D. StRANATHAN, University of Kansas 
C. N. WALL, University of Minnesota 
HuGuH C. WoLFe, Cooper Union 
Rosert W. YouneG, U. S. Navy Electronics Laboratory, 
San Diego 





HE AMERICAN JOURNAL OF Paysics is devoted to the instructional and 

cultural aspects of physical science. Previous to 1940 this publication 

was known as The American Physics Teacher. It is published monthly, ex- 

cept during June, July and August, by the American Institute of Physics for 

the American Association of Physics Teachers at Prince and Lemon Streets, 
Lancaster, Pennsylvania. 





Manuscripts for publication should be submitted to Dr. Thomas H. Osgood, 
American Journal of Physics, School of Graduate Studies, Michigan State 
College, East Lansing, Michigan. The authors’ institutions are requested to 
pay an optional publication charge of $4 per page which, when honored, en- 
titles them to 100 free reprints. Instructions will be sent with galley proofs. 

Proof and correspondence concerning papers in the process of being printed 
should be addressed to the American Institute of Physics, 57 East 55 Street, 
New York 22, New York. 


Subscription Price 
United States and Canada...........cecceeees $6.00 
BI 5 oss os eae ane hae ces xededet 7.00 
Back Numbers 


Complete set: Vol. 1, 1933-Vol. 21, 1953—$147.00 
Yearly back number rate: $8.00 
Single copies: $2.00 each 1933-1936; $1.50 each thereafter. 
Cumulative index (Vols. 1-20, 1933-1952): $3.00 
Subscriptions, renewals, and orders for back numbers should be addressed 
to the American Institute of Physics, 57 East 55 Street, New York 22, New 
York. Members and junior members of the American Association of Physics 
Teachers receive the American Journal of Physics. 
Changes of address should be addressed to the American Institute of 
Physics. 
The contents of the American Journal of Physics can be found indexed in 
the Education Index. 





AMERICAN 
JOURNAL of PHYSICS 


A Journal Devoted to the Instructional and Cultural Aspects of Physical Science 





Votume 22, Numer 5 


May, 1954 





Reproductions of Prints, Drawings, and Paintings of Interest 
in the History of Physics 


61. Caricatures of Sir Isaac Newton by Two Famous Artists 


E. C. Watson 
California Institute of Technology, Pasadena, California 


(Received August 17, 1953) 


Caricatures of famous scientists, particularly when done by competent artists, often have 
historical as well as entertainment value. Two amusing caricatures of Isaac Newton, one by 
John Leech, the other by George Cruikshank, are here reproduced primarily for the reader's 


entertainment. 


S D. McKie and G. R. DE BEER wrote re- 
cently, “the story of Newton and the apple 
was quick to strike the imagination of philoso- 
phers and has long been cherished as one of the 
most delightful as well as important anecdotes 
in the history of science.’”’ Unfortunately, anec- 
dotes of this kind are all too often apocryphal 
and it was uncertain until recently whether or 
not the story of the apple was in this category.! 
However, it now appears that it can be accepted 
at face value. This conclusion is based primarily 
upon certain memoirs by WILLIAM STUKELEY 
which were unpublished until recently.” 

A. Hastincs White, the editor of these 
memoirs, states that ‘‘WILLIAM STUKEKEY, 
physician, divine, and one of the best known 
antiquaries of the eighteenth century, was, in 
his own phrase, ‘a countryman of Sir Isaac 


1 The evidence for and against the validity of the story 
has been analyzed and presented by D. McKie and 
G. R. de Beer in Notes and Records Roy. Soc. 9, 46-54, 
33-335 (1951-52). 

2 Memoirs of Sir Isaac Newton's Life by William Stukeley, 
M.D., F.R.S. 1752. Being Some Account of his Family and 
Chiefly of the Junior Part of his Life, edited by A. Hastings 
White (Taylor and Francis, London, 1936). 


Newton’s.’ Although he was forty-five years 
Newton’s junior, they came together in the 
Royal Society about the year 1718, and were on 
terms of familiar friendship during the closing 
years of Newton’s life. In 1726 Stukeley removed 
from London to take up his residence at Gran- 
tham, and there he made it his business to collect 
reminiscences of the early life of Newton.” In 
these reminiscences STUKELEY writes as follows :? 

“On 15 April 1726 I paid a visit to Sir Isaac at 
his lodgings in Orbels buildings in Kensington, 
dined with him and spent the whole day with 
him, alone.... After dinner, the weather being 
warm, we went into the garden and drank thea, 
under the shade of some appletrees, only he and 
myself. Amidst other discourse, he told me, he 
was just in the same situation, as when formerly, 
the notion of gravitation came into his mind. It 
was occasion’d by the fall of an apple, as he sat 
in a contemplative mood. Why should that 
apple always descend perpendicularly to the 
ground, thought he to himself. Why should it 
not go sideways or upwards, but constantly to 


3 Reference 2, pp. 19-20. 


247 





248 


the earths centre? Assuredly, the reason is, that 
the earth draws it. There must be a drawing 
power in matter: and the sum of the drawing 
power in the matter of the earth must be in the 
earths center, not in any side of the earth. There- 
fore dos this apple fall perpendicularly, or to- 
wards the center. If matter thus draws matter, 
it must be in proportion of its quantity. There- 
fore the apple draws the earth, as well as the 
earth draws the apple. That there is a power, like 
that we here call gravity, which extends its 
self thro’ the universe. 

“And thus by degrees he began to apply this 
property of gravitation to the motion of the 
earth and of the heavenly bodys, to consider 
their distances, their magnitudes and their 
periodical revolutions; to find out, that this 
property conjointly with a progressive motion 
impressed on them at the beginning, perfectly 
solv’d their circular courses; kept the planets 
from falling upon one another, or dropping all 
together into one center; and thus he unfolded 
the Universe. This was the birth of those amazing 
discoverys, whereby he built philosophy on a 
solid foundation, to the astonishment of all 
Europe.” 

The scene here described is amusingly por- 
trayed (Fig. 1) by Jonn LEEcH in one of his 
woodcut illustrations for The Comic History of 


Discovery of the Laws of Gravitation by Isaae Newton. 


Fic. 1. Discovery of the laws of gravitation 
by Isaac Newton. 


E. C. WATSON 


England, by Gilbert Abbott A’ Becket (London, 
1847, 1848). Jonn LeEEcH (1817-1864), who is 
perhaps best known for his woodcuts in Punch 
extending over a period of more than twenty 
years, was the first English caricaturist whose 
work was clean and wholesome throughout. He 
was a true humorist who relied not so much upon 
exaggeration and grotesqueness as upon the 
whimsical aspects of human life. There is nothing 
in STUKELEY’S account to indicate that the apple 
actually struck NEWTON on the head as shown in 
LEECcH’s woodcut. This refinement seems to have 
been introduced by D’Israeli, for AuGustus 
De MorGAN says in his Budget of Paradoxes 
(London, 1872), p. 81, that “D’Israeli has got 
an improvement on the story: the apple ‘struck 
him a smart blow on the head:’ no doubt taking 
him just on the organ of causality. He was 
‘surprised at the force of the stroke’ from so 
small an apple: but then the apple had a mission; 
Homer would have said it was Minerva in the 
form of an apple. ‘This led him to consider the 
accelerating motion of falling bodies,’ which 
Galileo had settled long before: ‘from whence he 
deduced the principle of gravity,’ which many 
had considered before him, but no one had 
deduced anything from it. I cannot imagine 
whence D’ Israeli got the rap on the head, I mean 
got it for Newton: this is very unlike his usual 
accounts of things. The story is pleasant and 
possible: its only defect is that various writings, 
well known to Newton, a very learned mathe- 
matician, had given more suggestion than a 
whole sack of apples could have done, if they had 
tumbled on that mighty head all at once.” 

STUKELEY, in his Memoirs, also comments 
both upon NEwTon’s absent-mindedness and 
upon what seems to have been his only love 
affair. Regarding the latter he writes as follows :4 

“Mrs. Vincent is a widow gentlewoman living 
now at Grantham (1727) aged 82. Her maiden 
name was Storey.... Her mother, who was a 
handsom woman, was second wife to Mr. Clark, 
the apothecary, where Sir Isaac lodg’d, so that 
she lived in the same house with him all the 
time of his being at Grantham school, which 
was 7 years. Further, her mother and Sir Isaacs 
mother were intimately acquainted, which was 
one reason of his lodging at Mr. Clarks.... 


4 Reference 2, pp. 45-46. 





CARICATURES OF ISAAC NEWTON BY TWO 


“Sir Isaac and she being thus brought up to- 
gether it is said that Sir Isaac entertaind a pas- 
sion for her when they grew up; nor dos she deny 
it. ‘Tis certain he always had a great kindness 
for her. He visited her whenever in the country, 
in both her husbands days, and gave her, at a 
time when it was useful to her, a sum of money. 
She is a woman but of a middle stature, of a 
brisk eye, and without difficulty we may discern 
she has been very handsom.”’ 

Regarding NEWTON’s absent-mindedness, StTu- 
KELEY gives several accounts, of which the 
following is typical :* 

“At Cambridge I often heard storys of his 
absence of mind, from common things of life. 
As when he has been in the hall at dinner, he has 
quite neglected to help himself, and the cloth 
has been taken away before he has eaten any- 
thing. That sometime, when on surplice days, he 
would goe toward S. Mary’s church, instead of 
college chapel, or perhaps has gone in his surplice 
to dinner in the hall. That when he had friends to 
entertain at his chamber, if he stept in to his 
study for a bottle of wine, and a thought came 
into his head, he would sit down to paper and 
forget his friends. Thus the human mind, wholly 
taken upon abstract reasonings, and long con- 
catenation of causes and consequences, was apt, 
as it were, to desert the body, assume its essential 
and true life and enjoy those superlative pleas- 
ures arising from contemplations of the most 
worthy sort, nearly approaching to angelical. Tis 
an anticipation of part of those divine joys in our 
future state of being.” 

This characteristic of NEWTON has been seized 
upon by LEECcH’s great contemporary, GEORGE 
CRUIKSHANK (1792-1878), in his ludicrous cari- 


5 Reference 2, p. 61. 


FAMOUS ARTISTS 249 


Fic. 2. Sir Isaac Newton’s courtship. 


cature entitled “Sir Isaac Newton’s Courtship” 
(Fig. 2), in which NEWTON is represented as 
absent-mindedly using the lady’s little finger 
to tamp down the tobacco in his pipe, much to 
her consternation. The head of NEWTON was 
apparently copied from KNELLER’s well-known 
portrait, so it at least is not a figment of the 
artist’s imagination. The caricature as a whole 
is a good example of CRUIKSHANK’S technical 
skill as an etcher which was such that he must 
be ranked among the great masters of that art as 
well as of caricature. 

Unfortunately, both of these caricatures give 
the impression that NEWTON was a confirmed 
pipe smoker. I have been unable, however, to 
find any evidence that he ever smoked. 


Symposium on Molecular Structure and Spectroscopy 


The Symposium on Molecular Structure and Spec- 
troscopy will be held at the Department of Physics and 
Astronomy, The Ohio State University, from June 14 to 
June 18, 1954. There will be discussions of the interpreta- 
tion of molecular spectroscopic data as well as methods 
for obtaining such data. In addition, there will be sessions 


devoted to those phases of spectroscopy of current interest. 
A dormitory will be available for those who wish to reside 
on the campus during the meeting. For further information, 
or for a copy of the program, write to PRoFEssor H. H. 
NIELSEN, Department of Physics and Astronomy, The 
Ohio State University, Columbus 10, Ohio. 





The Effective Mass of Electrons in Crystals 


C. KITTEL 
University of California, Berkeley, California 
(Received October 30, 1953) 


The transfer of momentum between a conduction electron and a crystal lattice is examined in 
detail. By physical considerations we are led to the expression 


Mm fii 


(2) as+2 G—|a6(k)|? 


0k; 


for the reciprocal effective mass tensor, where G is 27 times a reciprocal lattice vector and ag (k) 
is a coefficient in the momentum representation of the Bloch eigenfunction: 


VK = z ag(k) exp[i(G+k)-r]. 


This expression for the effective mass tensor is shown to be completely equivalent to the usual 


definition. 


HE effective mass of a conduction electron 

in a crystal is an important part of the 
conceptual structure of solid-state physics. Ref- 
erence will be found in every textbook on the 
quantum theory of solids to the theorem that if 
W,, is the energy of a crystal eigenstate y,, then 
a wave packet centered about the wave vector k 
will move under applied electric and magnetic 
fields as if endowed with the effective mass m*, 


where 
m m ?W, 
m* ij h? 0k;0k; 


A well-known consequence of Eq. (1) is that ina 
periodic lattice an electron moves as if endowed 
with an effective mass different from the free 
electron mass m, without conflicting with the 
classical acceleration equation. The apparent 
conflict between the use of m and m* is resolved, 
as discussed qualitatively by several authors,! 
on taking into account the interaction of the 
electron and the crystal lattice. The purpose of 
this note is to exhibit the mechanism of the mo- 
mentum transfer in an explicit and quantitative 
way. We may note that the effective masses of 
charge carriers in germanium crystals has been 
measured directly by cyclotron resonance.!* 

We consider the Bloch eigenfunction yy be- 
longing to the energy eigenvalue Wy and wave 
vector k. We may analyze y, in a Fourier series, 


1 See, for example, J. C. Slater, Revs. Modern Phys. 6, 
209 (1934), pp. 260-261. 


1a Dresselhaus, Kip, and Kittel, Phys. Rev. 92, 827 
(1953). 


thereby obtaining the expansion of y in the 
plane wave or momentum representation : 


Ve= dc (k) exp[¢(G+k)-r]. (2) 


Here G is 2m times a vector in the reciprocal 


lattice. The momentum of an electron in the state 
k is 


Pei = (k| —ihkV|k) = Lh(G+k) |aG (k) |? 


=h[k+ FG |ac (k)|*J, (3) 


taking y normalized in unit volume. We now 
examine the transfer of momentum between the 
electron and the lattice when the state k of the 
electron is changed to k+ Ak. 

We imagine an insulating crystal electrostat- 
ically neutral except for a single electron in the 
state k of the otherwise empty conduction band 
of the crystal. We suppose that a weak external 
electric field is applied for a long time interval, 
such that the total impulse given to the entire 
crystal system is 


j= f eEdt. (4) 


The interval is taken as long in order to make 
possible in principle the approximate definition 
of the final state of the system. 

If the conduction electron were free (m*=m), 
the total momentum imparted to the crystal 
system by the impulse would appear in the 


250 





THE EFFECTIVE MASS OF 


change of momentum of the conduction electron: 


(S) 


The neutral crystal suffers no net interaction 
with the electric field, either directly or indirectly 
through the conduction electron. 

If the conduction electron interacts with the 
periodic potential of the crystal lattice, we must 
have 


j= APtot = Apel = hAk. 


$ = APtot = APiat + Apai. 
From Eq. (3) we have 


APpea= adk+ DAG( (Ve |ag(k)|*)-Ak]. (7) 


(6) 


The change Apiat in the lattice momentum 
resulting from the change of state of the electron 
may be derived by an elementary physical con- 
sideration. We are familiar with the circum- 
stance that an x-ray photon suffering Bragg re- 
flection from a crystal lattice transfers momen- 
tum to the lattice. Similarly, an electron reflected 
by the lattice transfers momentum to the lattice. 
If an incident electron with momentum hk is 
reflected with momentum 4(k+G), the lattice 
acquires the momentum —AG as required by 
momentum conservation. The momentum trans- 
fer when the state yx goes over to Yxiax iS NOW 
seen to be 


APiat = —hUGL (Vil ag () |?- Ale], (8) 


as only the portion 
Vic | @g (Ik) |?- Ak 


of each individual component of the initial state 
is effectively reflected during the state change 
Ak. 


The total momentum change is therefore 
Apert Apiat = 9 =hAk, (9) 


exactly as for free electrons, Eq. (5). Thus from 
Eq. (9) and the definition of 9, Eq. (4), we have 


dk/dt = eE/h. (10) 


This is a well-known relation, one which is usually 
derived by quite different methods. 

A natural definition of the effective mass is 
that m* is the mass a free electron would need in 
order that the velocity increment under the 
applied impulse should be equal to the actual 


ELECTRONS IN CRYSTALS 251 


velocity increment of the conduction electron 
under the same impulse 
Ape /m = hAk/m*. (11) 


The reciprocal effective mass tensor is defined by 
this equation and is equal to, using Eq. (7), 


“ a 
(=) =5j+X G—lag(k) |? (12) 


This is our central result. It does not appear to 
have been given explicitly in the literature 
previously. The derivation has been motivated 
entirely by physical considerations in which the 
momentum exchange with the lattice enters in a 
clear and concrete way, this being the only value 
of the derivation. The tensor is actually sym- 
metric, as shown below. 

It is easily possible to prove the complete 
equivalence of the effective mass tensor as given 
in Eq. (12) to the usual definition, Eq. (1), which 
is derived by arguments involving the motion of 
wave packets. We start with an equation given 
by Slater,? who shows directly that the expecta- 
tion value of the velocity of a state yy as cal- 
culated by Eq. (3) above is identical with the 
usual expression hv = V, W for the group velocity. 
Slater’s result is 


h 
hv=ViW=—(k+D Glac(k)|2]. (13) 
m G 


On further differentiation we find 


aes 
ak,ak; mb Gc 


0 
Jaci) |") (14) 

k; 

Now from the usual definition of the effective 

mass tensor [Eq. (1) ] we have 


(~.) m ew 1G 0 | sii (15) 
—) = 63; i—|@o , 
m*} ;; i akiok; | 6 dk; 
in agreement with Eq. (12). This proves the 
equivalence of the two definitions. The usual 
definition is obviously symmetric in 7 and j, so 
that our definition must also be symmetric, in 
spite of the apparent unsymmetrical form. 

It is useful to consider the physical implica- 
tions of Eq. (12) with reference to the familiar 


2 Reference 1, Appendix VI. The use of this relation here 
was kindly suggested by J. M. Luttinger. 





252 


weak binding approximation in one dimension 
with a perturbation V=2V,)cosGx. The band 
structure is shown in Fig. 1. At a point A near 
the bottom of the lower band the state is repre- 
sented quite adequately by a plane wave e*=; 
the reflected component is small and increases 
only slowly as k is increased, so that in this region 
m*=~m. At B the reflected component e**-®= is 
quite large; it is larger still at C and becomes 
equal in amplitude to e*** at D, at which point 
(as at E also) the eigenfunctions are standing 
waves, rather than running waves. It is thus not 
surprising to find in the region B to D negative 
values for m*. A negative m* means only that on 
going from state k to state k+ Ak the momentum 
transfer to the lattice is opposite to and larger 
than the momentum transfer to the electron. 
Although & is increased by Ak by the applied 
electric field, the consequent Bragg reflections 
result in an over-all decrease in the momentum of 
the electron, so that the effective mass may be 
described as being negative. As we proceed from 
E towards F, the amplitude of e*“*-®= decreases 
and m* assumes a small positive value; that is, 
the increase in electron velocity resulting from a 
given impulse is larger than that which a free 
electron would experience. The lattice makes up 
the difference through the recoil it experiences 
when the amplitude of e*“-®= is diminished. 
We may derive an interesting expression for 
the effective mass in the tight-binding approxi- 
mation. We write for the Bloch function 


vp(t) = 2 e™ tip (x 15). 


k 


(16) 


We suppose that ¢(r—r;) is a nondegenerate 
atomic-type (Wannier) function centered about 


C. KITTEL 


Ww 


k 


Fic. 1. Band structure arising from the perturbation 
2Vo cosGx. The eigenfunctions ¥; may be written approxi- 
mately as linear combinations of e*** and e*(*-@, with 
relative amplitudes depending on k. 


r; and orthogonal to the ¢’s centered about other 
lattice points. Then it is easy to show that 


N 
cial” eaihliaa (17) 


where 


p(k+G)= f expl -i(k+6)-r]o(nde. (18) 


Here JN is the number of atoms in the volume Q. 
Thus, 


m N 0 
(—.) =8,,=— LD G—|p(k+G)|?; (19) 
m* 7 5; Q 6G k; 


this equation relates the effective mass to the 
Fourier components of the atomic function ¢. 

I am indebted to Professor J. M. Luttinger for 
helpful conversations. This research has been 
assisted in part by the U. S. Office of Naval 
Research, and the U. S. Signal Corps. 


Iowa Colloquium of College Physicists 


The dates of the Annual Colloquium of College Physi- 
cists at the University of Iowa are June 16-19, 1954. 

As is traditional with the Colloquium, there will be 
emphasis on the exhibit of new devices, reports of research, 
and round tables on teaching problems. Special considera- 


tion will be given Radio Astronomy and the Application 
of Atomic Power. The program will be climaxed by the 
four lectures of DEAN J. H. VAN VLEcK of the Division of 
Applied Science at Harvard, on Radio and Microwave 
Spectroscopy of the Solid State. 





A Note on the Study of Uniformly Accelerated Motion 


Henry S. C. CHEN 
Drexel Institute of Technology, Philadelphia, Pennsylvania 


(Received October 30, 1953) 


A method to analyze the experimental data is suggested, in which no assumption is made 
beforehand that the motion is uniformly accelerated. Instantaneous velocities are obtained from 
the limits approached by the average velocities, and the linearity of velocity vs time curve proves 


the constancy of the acceleration. 


N the experiment on uniformly accelerated 

motion, a spark timer, or a tuning fork, is 
generally used to mark out the different positions 
at different times. Beginning at a mark some- 
where after the starting point, when the motion 
has become steady, the students measure the 
total distances traveled at different instants of 
time. First differences are then taken; they are 
the average velocities for the successive unit time 
intervals. Thus the first value is the average 
velocity for the interval t=0-—>1, the next value 
is the average velocity for the interval t=1—2, 
etc. 

After the student gets these values for the 
average velocities, he is told to do one of the two 
things. He might be asked to take the second 
differences, which turn out to be nearly constant. 
Using the equation for uniformly accelerated 
motion, the instructor then proceeds to show that 
the successive differences of average velocities 
should be identified with the constant accelera- 
tion. Or the student might be told that since the 
motion is uniformly accelerated, the average 
velocity for a certain time interval is equal to the 
instantaneous velocity at the middle of the time 
interval. Thus the average velocity for the inter- 


val t=0-—>1 is the instantaneous velocity at t=}, 
the average velocity for the interval t=1-2 is 
the instantaneous velocity at t=1}. Thus if the 
average velocities are plotted against t=4, 13, 
23, etc., a velocity-time curve is obtained, and 
the slope of the curve gives the acceleration. 

As a means to determine the value of the 
constant acceleration, both procedures serve 
equally well. But as an experiment to study the 
motion in its fundamentals, both methods leave 
much to be desired. It seems to the writer that 
in order to study a certain problem, it is best 
not to assume the result of the study beforehand. 
Such a process may be necessary in complicated 
cases where one can only prove his point by 
indirect means. But in simple phenomena, it 
seems more convincing and elegant if one delves 
into the problem, without any preconceived ex- 
pectations, and looks for the proper correlations. 
In the experiment on uniformly accelerated 
motion, this can be achieved very easily by the 
following change in the analysis of experimental 
data. 

Starting from a point O, we have measured 
the distance OA at t=1, OB at t=2, OC at 
t=3, etc. Instead of taking first differences, we 


TABLE I. Average velocity for the various time intervals. P 


Time in Average 


Average 
intervals Distance velocity 


Distance velocity Distance 


0 
4.02 
9.31 

16.16 
24.44 
34.11 
45.26 
57.99 
v1.74 
87.16 


Average 
velocity 


0 

5.29 
12.14 
20.42 
30.09 
41.24 
S343 
67.69 
83.14 


0 
2.57 
6.59 

11.88 
18.73 
27.01 
36.68 
47.83 
60.32 
74.28 
89.73 


SDOMOmNIAUP WHE OS 
90 90 ID DE Gd 
CNN HR PRAOCHYN 
SIU PWR OONAO 
© 00 00 STAD Ur i 
Drow Uwe waod 
-_ 
DO 20 CONIA DN 
WACK MUOON 
DAIAMNNH TO 


—_ 


Average 


A Average 
velocity 


a Average 
velocity 


Distance velocity 


Distance Distance 


0 0 
8.28 8.28 9.67 
17.95 8.98 20.82 
29.10 9.70 33.31 
41.59 1040 47.27 
55.35 Mai 0230 G2.72 
71.00 = 11.83 


6.85 
7.56 
8.27 
8.99 
9.69 
10.40 
11.12 


9.67 
10.41 
11.10 
11.82 
12.54 





Ss. 


2 4 > 
Time in intervals. 


Fic. 1. Average velocity vs size of time interval. 


divide OA by 1, OB by 2, OC by 3, etc., and get 
the average velocities for time intervals of 1 unit, 
2 units, 3 units, etc., starting from the point O. 
Thus we have the average velocities beyond the 
point O for different sizes of time interval. When 
the size of the time interval becomes infinitesi- 
mally small, we arrive at the instantaneous 
velocity at O. This is done by plotting average 
velocities against the size of the time interval, 
and producing the curve to cut the y axis; the 
y intercept is then the instantaneous velocity at 
O. By using the other points A, B, etc., as the 
starting points, we can proceed as before, and 


get instantaneous velocities for these points. Thus 


= 
3 
> 
wo 
3 
£ 
2 
0) 
oe 5 


Fic. 2. Instantane- 
ous velocity vs time 
interval. 


4 
Time in Intervals. 


Cc. CREN 


we have obtained the instantaneous velocities at 
different instants of time. Plotting instantaneous 
velocity against time, one can see that the rela- 
tionship is linear, and thus show that the 
acceleration is constant. 

Table I shows data taken from the actual 
readings of a student group. The time is meas- 
ured in arbitrary units, and the distance is in 
centimeters. The second and the third columns 
give the distances traveled and the average 
velocities beyond the first point O, the third and 
the fourth columns give the same quantities 
about the second point A, etc. Figure 1 shows 
the average velocities plotted against the size 
of the time interval. The lowest curve is for the 
first point O, the next curve is for the next point 
A, etc. Their y intercepts give the instantaneous 
velocities. Figure 2 shows instantaneous veloci- 
ties plotted against time. The linearity of the 
curve shows that the acceleration is constant, and 
the value of that constant acceleration is found 
to be 1.424 cm/interval/interval. 

In Fig. 1 we have plotted average velocity 
against the size of the time interval. The ordinate 
is s/t and the abscissa is ¢. Experimentally, we 
get s/t=b+kt. The intercept has been identified 
with vp, so that we can write s=vot+ki. Com- 
paring this with s=vot+ }a?’, we readily see that 
the slope of the lines in Fig. 1 should give half 
the acceleration. They turn out to be 0.714, 
0.713, 0.725, 0.709, 0.707, and 0.716 for the six 
lines. Multiplying by 2 and taking the mean, we 
get the acceleration as having the value 1.424 
+0.006 cm/interval/interval, which is to be com- 
pared with 1.424 from the second graph. 

As a matter of fact, this method is not limited 
to the study of uniformly accelerated motion. It 
works equally well in other types of motion. It is 
hoped that its application to simple harmonic 
motion will be dealt with in a later paper. It is 
to be noted that here is one experiment in which 
the student sees the mathematical concept of the 
limit being applied in physical phenomena. 





Valence Bond Calculations 


L. A. ScHMID 
Michigan State College, East Lansing, Michigan 


(Received November 19, 1953) 


The nature of the binding in molecular and ionic crystals and in metals is briefly discussed 
from an intuitive point of view. The salient features of the Slater-Pauling theory of the valence 
bond are then developed by generalizing the results of the Heitler-London treatment of the 
hydrogen molecule. The application of the theory to the calculation of the cohesive energy of 
diamond, the classic example of a valence crystal, is then discussed. 


INTRODUCTION 


UST as the calculation of the energy levels of 
the hydrogen atom may be regarded as the 
fundamental problem of theoretical spectros- 
copy, the calculation of cohesive energy, which 
is just the energy difference of the atoms in the 
free and solid states, for the various types of 
solids constitutes the fundamental problem of 
theoretical solid-state physics. Cohesive energy 
calculations yield information concerning the 
distribution of electrons within the solid and the 
nature of the forces which bind the atoms of the 
solid together. However, judged by the standards 
of accuracy customary in spectroscopic calcu- 
lations, cohesive energy calculations make a poor 
showing indeed. The mathematical difficulties 
involved in a rigorous solution of the problem 
are so staggering—a partial differential equation 
in something of the order of 10” variables is 
involved—that it is necessary to resort to rather 
sweeping assumptions and approximations in 
order to bring the problem within the range of 
what is humanly possible. Even the simplified 
version of the problem is often of such mathe- 
matical complexity that the calculations can be 
carried out only to a rather rough approximation. 
Thus it should not be at all surprising that the 
accuracy of the calculated cohesive energy is 
highly questionable. 

Unless we appreciate that the calculated 
energy is not an end in itself, but rather just a 
means to an end, we would be prone to view the 
work done in this field with a degree of skepticism. 
The actual goal of such a calculation is to test 
the theoretical model on which it is based. From 
this point of view, the calculated cohesive energy, 


when compared with the observed value of the 
cohesive energy, provides a “figure of merit’’ for 
the theoretical model in question. In this way, 
we are able to acquire an intuitive feeling for the 
nature of binding in solids. This intuitive under- 
standing provides the motivation and justifica- 
tion for cohesive energy calculations. 

In this paper we shall be concerned principally 
with the nature of the valence bond, the purest 
illustration of which is found in diamond. As 
early as 1916 Lewis! proposed the now familiar 
model in which two bonded atoms are thought of 
as sharing two valence electrons between them. 
Essential to this model for explaining the nature 
of the binding of a valence crystal is the assump- 
tion that the cohesive energy is associated only 
with the interaction between paired electrons 
and that the interaction between unpaired elec- 
trons may be neglected. This assumption has 
been almost universally made in the application 
of the Slater-Pauling (SP) theory of the valence 
bond which gives theoretical expression to the 
intuitive ideas of Lewis. It had long been realized 
that the SP theory handled an apparently minor 
point concerning the orthogonality of the electron 
wave functions in a rather cavalier mariner, but 
the seriousness of this situation was only recently 
pointed out by Slater. The SP theory has since 
then been reformulated to correct for this de- 
ficiency* and, when the revised theory is applied 
to the calculation of the cohesive energy of 
diamond, it is found that the familiar concept of 
sharing electrons needs some revision. 


1G. N. Lewis Valence and the Structure of Atoms and 
ma (Reinhold Publishing Corporation, New York, 
1 


2 J. C. Slater, J. Chem. Phys. 19, 220 (1951). 
3L. A. Schmid, Phys. Rev. 92, 1373 (1953). 


255 





be As 


NATURE OF BINDING IN CRYSTALS 


In order to put the discussion concerning the 
nature of binding in valence crystals in proper 
perspective, it would be well first to give brief 
descriptions of the nature of the binding in the 
other principal types of crystals. These de- 
scriptions will involve only the most important 
contributions to the cohesive energy. Such 
descriptions would be sorely inadequate if we 
were interested in calculating the cohesive energy 
even to only a rather modest accuracy, but since 
we are interested in ways of picturing the nature 
of the binding in solids, and indeed, since this is 
the major mission of cohesive energy calculations, 
the simplifications are justified. Needless to say, 
however, in establishing the theoretical models 
on which these descriptions are based, the details 
omitted here had to be given considerable 
attention. 

As far as cohesive energy calculations are 
concerned, crystalline solids may be divided into 
four major types: 


1. molecular crystals; 
ionic crystals; 


2: 
3. metals; 
4. valence crystals. 


We shall consider each of these types in order. 


MOLECULAR CRYSTALS‘ 


Examples of this crystal type are the solid 
phases of the rare gases He, Ne, etc., as well 
as the solid phases of He, Oz, No, Cle, and Is. 
These crystals may be thought of as an aggregate 
of *molecules which, by huddling together, find 
that they can lower their total energy, and thus 


3 3 
+ 2 
* te 
- pe 


~ os 
e e 


= & 


Fic. 1. Correlation of electron positions in van der 
Waals attraction. 


4See, for example, F. Seitz, Modern Theory of Solids 
(McGraw-Hill Book Company, Inc., New York, 1940), 
pp. 391-394, 


SCHMID 


prefer the solid state over the gaseous state. 
While in the solid state, however, the individual 
molecules do not lose their identity. The force 
which brings about binding is the van der Waals 
force. While this is in general very difficult to 
calculate, it is quite easy to think about in an 
intuitive way. For purposes of illustration, let 
us consider the van der Waals interaction be- 
tween two monovalent atoms. Each atom con- 
sists of an ionic core outside of which the valence 
electron moves. If the charge density of each 
valence electron were uniformly distributed 
about its core, and if the two atoms were far 
enough apart so that the charge density of one 
valence electron did not overlap the charge 
density of the other, then each atom would be 
completely unaware of the presence of the other 
atom and there would be no attractive force 
between them. However, the valence electrons 
are not charge clouds but rather moving point 
charges. (Recall that, even if the wave function 
for the electron is symmetric about the core, this 
merely means that the probability of finding the 
electronic point charge is independent of its 
angular position, not that the electronic charge is 
uniformly smeared about the core.) Thus each 
atom appears to the other to be an electric dipole 
whose orientation in space is continually chang- 
ing. The interaction between these two rotating 
dipoles is such that their orientations become 
correlated as shown in Fig. 1. Corresponding 
positions of the two valence electrons are indi- 
cated by corresponding numbers. It is apparent 
that the two dipoles always arrange themselves 
so that their mutual energy is a minimum. This 
gives rise to an attractive force. When more than 
one valence electron is present, the argument is 
the same except that now we must think in 
terms of correlations between the orientations of 
higher-order multipoles as_ well 
dipoles. 


as between 


IONIC CRYSTALS® 


The familiar example of this class is NaCl. In 
this crystal each positive Nat ion has for its 
nearest neighbors only negative Cl- ions which 
give rise to a potential well centered on the Nat 
ion and thus lower its energy. The Na* ion in 


5 Reference 4, Chap. II and pp. 385-391. 





VALENCE BOND CALCULATIONS 


turn helps create potential wells which lower the 
energy of its neighboring Cl- ions. This accounts 
for the attractive force which is necessary if the 
atoms (or ions) are to prefer the solid over the 
gaseous state. However, if only an attractive 
force were present, the ions would continue to 
draw closer to each other until the lattice col- 
lapsed into nothing. A repulsive force which is 
very small for large lattice parameters is needed. 
The elucidation of this repulsive force requires 
quantum mechanics. It is associated with the 
fact that as the ions are crowded together, the 
kinetic energy of the electrons increases. From 
the quantum-mechanical point of view, this in- 
crease in kinetic energy is associated with the 
distortion of the electronic wave functions which 
takes place when the ions begin to overlap. In 
quantum mechanics, the bumpier a wave func- 
tion, the greater the associated kinetic energy. 
Since the distortion in the electronic wave 
functions increases their bumpiness, the kinetic 
energy of the electrons is also increased. From 
the formal point of view of the calculations, this 
distortion in the wave functions arises from the 


need to make functions on neighboring atoms 
orthogonal to each other. The matter of ortho- 
gonality of neighboring functions will be dis- 
cussed more fully later on. 


METALS® 


As might be expected, the alkali metals with 
only one valence electron per atom have attracted 
the most theoretical attention. An alkali atom 
may be regarded as a compact ion core sur- 
rounded by the extensive valence electron wave 
function. In the solid the ion cores of neighboring 
atoms do not overlap, and so only the valence 
electrons need be subjected to a detailed analysis. 
As the atoms are brought together to form the 
solid, the valence electron wave functions— 
which are commonly called “‘orbitals,’’ the term 
‘“‘wave function’ being reserved for the total 
electronic wave function of the crystal—spread 
out until they extend throughout the entire 
lattice. The fact that the valence electrons are 

8 See, for example, N. F. Mott and H. Jones, Theory of the 


Properties of Metals and Alloys (Oxford University Press, 
New York, 1936), Chapters II and IV. 


Fic. 2. Potentials of free ions (solid curves) and resultant 
potential of ion cores in a metal (dashed curve). 


not bound to individual atoms is what gives 
metals their characteristic high electric and 
thermal conductivities. As the valence orbitals 
spread throughout the crystal, they necessarily 
become smoother with a corresponding decrease 
in kinetic energy. This constitutes a major contri- 
bution to the cohesive energy. There is also a 
Coulomb effect which contributes to the binding. 
Referring to Fig. 2, we see that if the solid curves 
represent the potentials of the individual ion 
cores, then the resultant potential of the cores in 
the solid will be given by the dashed curve which 
is just the sum of the two solid curves. It is 
evident that in the region between atoms the 
potential acting on the valence electrons is lower 
than that for the free atoms. This favors binding. 
There is a kinetic energy effect which works 
against binding and which comes into play at 
much larger lattice parameters than would be 
required to make the ion cores overlap with the re- 
sulting distortion of the core orbitals. This effect 
is associated with the Pauli exclusion principle 
which says that only one electron may occupy 
any given state of a system. Because the valence 
orbitals are spread throughout the entire lattice, 
they all occupy states of the same system, namely 
states belonging to the entire crystal. Because of 
the exclusion principle, the electrons are obliged 
to occupy states corresponding to higher kinetic 
energy than the lowest-energy state which they 
would all prefer to occupy. This increase in the 
kinetic energy is called the Fermi energy. For 
the observed lattice parameters of the alkalis, the 
Fermi energy is smaller than the decrease in 
energy resulting from the smoothing of the 
orbitals when they spread, so the net change in 
kinetic energy as the free atoms are brought 
together to form the solid favors binding. 





SB. HR. 


VALENCE CRYSTALS 


Diamond is the classic example of this crystal 
type. The Slater-Pauling (SP) theory, which 
gives theoretical expression to the familiar con- 
cept of sharing electrons, is essentially a generali- 
zation of the Heitler-London (HL) method em- 
ployed in calculating the ground-state energy of 
the hydrogen molecule He. Before discussing the 
HL method and the SP theory, it would be well 
to make some remarks concerning the electronic 
wave functions to be used in calculating the 
energies of molecules and crystals. 


Electronic Wave Functions 


If we let r; represent the three space coordi- 
nates and single spin coordinate of the first 
electron in the He molecule, and rz the coordi- 
nates of the second electron, then the total 
electronic wave function for the molecule has the 
form y(r1,2). Because of the indistinguishability 
of electrons, the charge density of the two elec- 
trons, which is given by [y(ri,re2) ?, must remain 
unchanged if the two electrons are interchanged, 
that is, 


[y (11,62) P= [¥(r2,r1) FP, (1) 


which requires that 
¥ (01,82) = ty (t2,01). (2) 


Whether the plus or minus sign is the appro- 
priate one depends upon the nature of the 
particles being interchanged. For electrons, which 
is the case in which we are interested, the minus 
sign must be used. In this case we say that the 
wave function must be antisymmetric for the 
interchange of the electron coordinates. When 
more than two electrons are present in the mole- 
cule or crystal, we may construct a total elec- 
tronic wave function which will be antisymmetric 
for the interchange of every pair of electron 
coordinates in the following way: Let ¢, g2, --- 
gw be the wave functions or orbitals (including 


Fic. 3. Schematic representation of hydrogen molecule. 


SCHMID 


spin as well as space coordinates) for each of the 
N electrons in the molecule or crystal. Then we 
may express the total electronic wave function 
as a determinant so constructed that the ith 
row involves only g; and the jth column involves 
only orbitals whose arguments are the jth set of 
electron coordinates. Thus 


¢1(2) ¢1(3) wen ¢i(N) 
¢2(2) ¢2(3) g2(lV) 
¢3(2) ¢3(3) ¢3(N) 


¢i(1) 
¢2(1) 
g3(1) 


y= 


gn(1) gw(2)  ow(3) = gw (N) 
Such a determinant is called a Slater determi- 
nant. We note that interchanging any pair of 
electron coordinates corresponds to interchanging 
two columns of the determinant which necessar- 
ily changes its sign. Thus the wave function is 
antisymmetric for the interchange of any pair of 
electron coordinates. It should be noted that a 
Slater determinant is not the most general form 
for a N-electron wave function because it has 
been constructed from the one-electron functions 
¢1, $2, *** gw instead of being completely arbi- 
trary in form, subject, of course, to the condition 
of antisymmetry. The use of such a wave function 
constitutes the well-known “one-electron ap- 
proximation” which is the starting point for 
almost all work in solid-state and molecular 
theory. 
Hydrogen Molecule 


Let us represent the H2 molecule as shown in 
Fig. 3. The circles in the figure around the two 
protons represent the spherically symmetric 
ground-state wave functions of the two hydrogen 
atoms in the free state. Let us represent the space 
parts (spin not included) of these two wave func- 
tions or orbitals by a and b. Let a and B be spin 
functions corresponding to spin ‘‘up’’ and spin 
“down.” (The two spin functions may be referred 
to any direction in space.) 

It is known that the ground state of the He 
molecule is a singlet state; that is, the spins of its 
two electrons are oppositely directed. Two ways 
of constructing an electronic wave function from 
the orbitals a and 6b which is antisymmetric for 
interchange of electron coordinates and which 
has the spins of the two electrons oppositely 





VALENCE BOND 


directed have been used in solving for the ground- 
state energy of the H2 molecule. The first of these, 
the Hund-Mulliken (HM) wave function, has the 
form 


vam =([a(1)+6(1) J[a(2)+5(2)] 
X[a(1)8(2)—B(1)e(2)]. (4) 


The second, the Heitler-London (HL) wave 
function, has the form 
vut=(a(1)b(2)+0(1)a(2)] 

x [a(1)6(2) —B(1)a(2)]. (5) 
In the HM method we think in terms of sausage- 
like wave functions a+) which extend over both 
atoms of the molecule. The space parts of the two 
electron orbitals are the same but they have 
opposite spins. In the case of the HL method we 
think in terms of orbitals whose space parts a and 
b are localized on different atoms of the molecule 
and whose spins are oppositely directed. Because 
of the indistinguishability of electrons, we must 
not think of the first electron as being localized on 
atom a and the second localized on atom 3, or 
vice versa, but rather we must regard the two 
situations as being equally likely. It is apparent 
that the HL function satisfies this condition. 

It would be well to remark at this point that we 
can modify the HL function so that instead of 
localizing the electrons on different atoms, we 
localize them on the same atom (taking care to 
show no preference for one atom over the other). 
This may be called the ionic molecule wave 
function. It has the form 


vim =[a(1)a(2)+6(1)b(2)] 
X (a(1)8(2)—B(1)a(2)]. (6) 
Because the energy corresponding to the ionic 
molecule is much higher than that of the ground 
state, this function does not enter into ground- 
state considerations in the usual HL calculation 
for Hz. We shall see, however, that it plays an 
important role in our later considerations. 
It should be noted that the HM function may 


be expressed in terms of a single Slater deter- 
minant having the following form: 


[a(1)+(1)Ja(1) [a(2)+6(2) Ja(2) 
La(i)+5(1) 181) La(2)+5(2) 18(2) 
=[a(1)+(1) }La(2)+0(2)] 

x [a(1)8(2) —B(1)a(2)]. (7) 


CALCULATIONS 259 
The HL function cannot be expressed as a single 
determinant, but rather requires a linear com- 
bination of Slater determinants, 


a(1)a(1) a(2)a(2)| [a(t)B(1) @(2)8(2) 


b(1)8(1) 6(2)8(2)| |b(1)a(1) (2)a(2 


= [a(1)b(2) +(1)a(2)] 
X[a(1)6(2) —B(1)a(2)]. (8) 


In constructing the HL function out of Slater 
determinants, we take the point of view that a 
and 6 are two bonded or paired orbitals having 
opposite spins, but because we cannot specify 
which spin goes with which orbital, we construct 
a determinant for each spin assignment and take 
the wave function to be a linear combination of 
these two determinants. The particular linear 
combination which is identical to the HL func- 
tion is given above. The ionic molecule function 
may be expressed in a similar way if we replace b 
in the first determinant with a, and a in the 
second determinant with b. 

The calculation’ for the ground-state energy 
of Hz shows that the binding results principally 
from the so-called ‘‘exchange term’’ which, if 
designated by K, has the form (in atomic units) 

ab 


—2S | —dr 
Rap Ra 


a(1)b(1)a(2)b(2) 
+ f f dr, (9) 


2 Ri 


ea 


where S is the overlap integral of the space parts 
a and 0 of the electron orbitals, 


S= f abdr, 


Ra is the distance between the two nuclei of Ho, 
and R, is the distance between the integration 
point of the volume integral in the second term of 
K and the nucleus (proton) a. The third integral 
is taken over three-dimensional space twice. Rj»: 
is the distance between the two points entering 
into the integration. This integral, which is 
called the ‘‘exchange integral,’’ may be thought 


(10) 


7™See, for example, Eyring, Walter, and Kimball, 
Quantum Chemistry (John Wiley and Sons, Inc., New York, 
1950), pp. 212-218. 





260 Bs: Ws 


of as the self-energy of a fictitious charge density 
given by ab. Because it has the form of the self- 
energy of a charge density, it is non-negative. 
Calculation shows that the second term in K 
predominates. Thus K is negative and constitutes 
the main contribution to the cohesive energy. 
Furthermore, the form of K indicates that a 
strong overlap of the bonded orbitals a and b 
favors binding. Indeed, a variational calculation 
based on the HL method shows that these two 
orbitals tend to stretch slightly toward each 
other. It is this fact, namely, that strong overlap 
of bonded orbitals favors binding, which lies 
at the base of the Slater-Pauling theory. 


Slater-Pauling Theory 


Let an arbitrary crystal or molecule be repre- 
sented by the two-dimensional lattice in Fig. 4. 
In the SP theory we assume that every valence 
orbital has the form of a lobe localized on an 
atom and pointing toward a neighboring atom 
which is bonded to the one on which the lobe is 
localized. These lobes are represented by the 
solid arrows in Fig. 4. It is evident that each lobe 
is paired off with a lobe of a neighboring atom. 
Paired lobes overlap a great deal, while unpaired 
lobes do not overlap very much. We may con- 
struct the wave function for the crystal or mole- 
cule represented in Fig. 4 by taking the point of 
view used to construct the HL function for He 
from a linear combination of Slater determinants. 
Paired orbitals must have opposite spins, but 
because we do not know which spin to associate 
with each orbital, we must take the wave func- 
tion to be a linear combination of Slater deter- 


Fic. 4. Schematic representation of binding 
in a valence crystal. 


SCHMID 


minants, one determinant for every possible way 
of assigning spins throughout the crystal. Thus if 
there are N/2 pairs of bonded orbitals in the 
crystal, there will be 2%? determinants in the 
linear combination. 

The manner of pairing the atoms, and hence 
the orbitals, in the crystal is not unique. Re- 
ferring to Fig. 4, we could take the lobes to be 
pointing in the directions indicated by the dashed 
arrows. Such a pairing of orbitals would give rise 
to a different total electronic wave function. 
Because the bonded atoms in the second (dashed 
arrows) case are farther apart than those in the. 
first (solid arrows) case, the bonded orbitals 
would overlap less and, arguing by analogy with 
the calculation for H2, the binding would be less, 
so the energy of the crystal using the wave 
function based on the second way of pairing 
atoms and orbitals would be higher than that 
based on the first way. If the energies associated 
with these different wave functions are all close 
together, we must take the actual total electronic 
wave function to be a linear combination of these 
different functions. (The form of this linear 
combination is chosen to minimize the energy of 
the crystal.) This mixing of wave functions to 
form the total electronic wave function is called 
“‘resonance.’’ If one way of pairing orbitals gives 
much lower energy than all other ways, then it is 
not necessary to mix different wave functions and 
we say that the bonds are ‘“‘localized.” 

In the case of localized bonds, an energy ex- 
pression may be derived’ which has the following 
form: energy of crystal=kinetic and Coulomb 
energy+exchange integrals between bonded or- 
bitals — 4 exchange integrals between nonbonded 
orbitals. In the usual application of this expres- 
sion, it is assumed that, because in the case of He 
the exchange term is negative and is almost 
entirely responsible for the binding, in the general 
case as well the exchange integrals between 
bonded orbitals are negative and are essentially 
responsible for the entire binding. It is assumed 
that, because nonbonded orbitals do not overlap 
much, their exchange integrals are negligible. 
Finally, it is assumed that the kinetic and 
Coulomb energy are essentially the same for the 
solid as for the free atoms and so do not contrib- 


® Reference 7, p. 248. 





VALENCE BOND 


ute to the binding. This gives rise to a bond-by- 
bond description of the binding, inasmuch as the 
binding energy is just the number of bonds 
multiplied by the energy associated with each 
bond. Because the exchange integrals are very 
difficult to evaluate, empirical data is used to 
determine a value for the exchange integral 
arising in a certain variety of crystal or molecule. 
This value is then used to compute the binding 
energy in other molecules or crystals of the same 
variety and apparently consistent results are 
achieved. 

Unfortunately, this line of reasoning has 
serious flaws. In order to derive the energy 
expression cited for the case of an arbitrary 
molecule or crystal, it is necessary to assume 
that the space parts of all the orbitals are 
mutually orthogonal, that is, their overlap 
integrals are zero. This will be true for distant 
orbitals because they do not overlap, but it will 
not be true for neighboring orbitals unless special 
pains are taken to make the space part of every 
orbital fluctuate in sign as a function of position 
so that the product of the space parts of two 
neighboring orbitals fluctuates in sign in such a 
way that its volume integral is zero. Such an 
alteration of the orbitals is called an ‘‘orthog- 
onalization.’”” Methods for accomplishing this 
have been worked out.® 

Referring to the expression for the exchange 
term resulting from the HL treatment of He, we 
see that for the case of orthogonal orbitals 
(S=0) the exchange term reduces to the ex- 
change integral which is positive and so cannot 
even contribute to the binding, let alone be the 
principal cause of binding. Moreover, the orthog- 
onalization of the orbitals will make them 
considerably bumpier and will thus increase the 
kinetic energy, so that this may not be neglected 
as is usually done. Finally, it is not at all apparent 
that the Coulomb interaction between atoms 
may be neglected as we imply if we assume that 
the Coulomb energy in the solid is essentially 
equal to that of the free atoms. 

The seriousness of these flaws in the usual 
formulation of the SP theory has been em- 
phasized? by applying the SP theory to the cal- 
culation of the ground-state energy of H2. This 


9P. O. Léwdin, J. Chem.£Phys. 18, 365 (1950). 


CALCULATIONS 261 
amounts to using a HL function built on orthog- 
onal, rather than nonorthogonal orbitals, as in 
the case of the original HL calculation discussed 
previously. It was found that the HL function 
built on orthogonal orbitals not only yields no 
binding, but actually gives an energy for He con- 
siderably higher than the energy of two free 
hydrogen atoms. If the calculation were taken 
seriously, it would mean that the He molecule 
could not exist since the hydrogen atoms would 
prefer to remain in the free state. 

We may achieve an insight into the reason for 
the failure of this calculation by expressing the 
HL function using orthogonal orbitals in terms 
of the nonorthogonal orbitals a and b. If we let 
A and B be the orthogonalized versions of a and 6, 
then if we concern ourselves only with the space 
parts of the wave functions [the spin parts are 
all the same as the spin part of the HM and HL 
functions given in Eqs. (4) and (5) ], we find? 


A (1)B(2)+B(1)A (2) 


1 
ms Tel Ca )6(2) +5(1)a(2)) 


— S(a(1)a(2)+0(1)b(2) J}. (11) 


Thus, from the point of view of nonorthogonal 
orbitals, the wave function we are using is a 
mixture of the HL function, which gives an 
energy close to the observed energy of He, and 
the ionic molecule function which corresponds 
to a much higher energy. Hence, it is not surpris- 
ing that the HL function built on orthogonal 
orbitals yields an energy greater than the energy 
of two free hydrogen atoms. The way to correct 
for this when we use orthogonal orbitals is to 
take as our wave function not the simple HL 
function, but rather a linear combination of the 
HL function and the ionic molecule function. 
Thus the electronic wave function for Hz would 
have the form 


¥=[A (1)B(2)+B(1)A (2) ] 
+ @[A(1)A (2)+B(1)B(2)], (12) 


where @ is a parameter which is adjusted to 
minimize the energy. When this is done very 
satisfactory results for the cohesive energy are 
obtained. 


At this point it would be well to make some 





A. 


Fic. 5. Schematic representation of binding in diamond. 


remarks concerning the Hund-Mulliken (HM) 
method for calculating the ground-state energy 
of He. It has been found that the HM function 
built on nonorthogonal orbitals yields a value for 
the cohesive energy of Hz which is about as 
satisfactory as the result obtained using the 
HL function built on nonorthogonal orbitals. 
Furthermore, since A and B may be expressed as 
linear combinations of a and b,? and because the 
HM function may be expressed as a single Slater 
determinant, the HM function built on orthog- 
onal orbitals is identical with that built on non- 
orthogonal orbitals. This follows from the fact 


that we may add together rows of a determinant 
to form new rows without changing the value of 
the determinant. Thus, in contrast to the situa- 
tion which prevails in the HL case, in the HM 
case we may construct the wave function using 
orthogonal, instead of nonorthogonal, orbitals, 
and still obtain satisfactory results. 


Diamond 


A recent calculation? has shown that the 
situation for diamond is completely analogous 
to that for Hz The bonds in diamond may 
be regarded as being localized in the manner 
indicated in Fig. 5. Any particular atom of the 
lattice (represented by the point at the center of 
the cube in Fig. 5) is surrounded by four nearest 
neighbors at the four corners of a circumscribed 
regular tetrahedron. The four valence orbitals of 
every atom have the form of lobes which point 
toward the neighboring atoms. These lobes are 
paired with, and overlap lobes centered on the 
neighboring atoms. 

Instead of expressing the total electronic wave 
function as a linear combination of 2%/? Slater 


SCHMID 


determinants built on one-electron functions as 
would be required by the usual formulation of the 
SP theory, the wave function may be constructed 
from the product of N/2 two-electron functions, 
each two-electron function corresponding to a 
bonded pair of lobe like orbitals. While such a 
product is not by itself antisymmetric with 
respect to the interchange of pairs of electron 
coordinates, it may be used to generate an anti- 
symmetric wave function in a very simple man- 
ner.* Any one of these two-electron functions has 
the form of the function given in Eq. (12) where 
now A and B stand for an orthogonal bonded 
lobe like pair of orbitals. Because there is no 
reason why any bonded pair of orbitals should be 
treated differently from any other bonded pair, 
the parameter @ is the same for all the two- 
electron functions entering into the total elec- 
tronic wave function. It may be shown that when 
@=0 the total electronic wave function built on 
two-electron orbitals is identical to that given by 
the usual formulation of the SP theory. When 
@= +41, it reduces to a single Slater determinant 
of sausage-like functions, each of which extends 
over two bonded neighboring atoms. This form 
of the wave function is the analog of the HM 
function for Hz since now two bonded orbitals 
have identical space parts, which extend over 
both bonded atoms, but have opposite spins. 

It is possible to derive an expression for the 
energy of the lattice in terms of the parameter @ 
and sundry matrix elements involving the 
orthogonal lobe-like orbitals. These matrix 
elements may then be evaluated, yielding an 
expression for the energy in terms of the pa- 
rameter @. For @=0, which corresponds to the 
energy associated with the usual SP wave 
function, the energy of diamond is found to be 
considerably higher than the energy of the 
corresponding number of free carbon atoms, 
which is just what happened when the SP theory 
was applied to Hz. The value of @ which mini- 
mizes the energy is @=+0.82. For this value of 
@ a very satisfactory value for the cohesive 
energy is obtained, again in complete analogy 
with the corresponding situation in He. Finally, 
it is found that the energy for @ = +1 differs only 
by a negligible amount from the minimum value 
for @ = +0.82. Thus the single Slater determinant 





VALENCE BOND 


of sausage-like orbitals yields a satisfactory 
result for the cohesive energy, just as the analo- 
gous HM function did in the case of the hydrogen 
molecule. 

The single Slater determinant of sausage-like 
orbitals provides the most visualizable way of 
thinking of the nature of the binding in diamond. 
While there are numerous contributions to the 
energy, if we limit ourselves only to the largest 
ones, we arrive at the following explanation of 
the binding: The need to orthogonalize the or- 
bitals as the atoms are brought together to form 
the solid makes them bumpier with a resultant 
increase in kinetic energy. This is partially offset 
by the smoothing which results when the orbitals 
spread out over two atoms. A complicated change 
in the Coulomb energy involving the interaction 
between neighboring bonded and nonbonded 


CALCULATIONS 263 
sausage-like orbitals contributes a negative en- 
ergy which accounts for the binding. The man- 
ner in which bonded sausage-like orbitals are 
paired off with opposite spins constitutes the way 
in which the familiar concept of the valence bond 
regarded as shared electrons makes its way into 
the theory. However, because of the importance 
of the Coulomb interaction between neighboring 
nonbonded sausage-like functions, the bond-by- 
bond description of binding in valence crystals 
and molecules, which neglects this interaction, 
would appear to lose its validity. In partial com- 
pensation for this loss, however, we have the 
satisfaction of knowing that the binding can be 
explained in terms of the familiar Coulomb and 
kinetic energies without the need to assign a 
major role to the intuitively mysterious ex- 
change energy. 


Techniques and Applications of Neutron Diffraction* 


M. K. WILKINSON 
Oak Ridge National Laboratory, Oak Ridge, Tennessee 


(Received October 15, 1953) 


A brief review is given of the advances which have been made in the field of neutron diffrac- 
tion during the past several years. The experimental techniques that have been developed are 
discussed, and illustrations are cited of several fields of application where information is ob- 
tained which would be very difficult or impossible to obtain by x-ray or electron diffraction. 


I. INTRODUCTION 


HE contributions which have been made to 

our knowledge of atomic, molecular, and 
crystalline structure by x-ray diffraction studies 
form one of the most important phases in the 
progress of modern physics. Additional contribu- 
tions have been obtained from the newer field 
of electron diffraction. A third field, neutron dif- 
fraction, has now reached the stage where it 
can be applied to similar problems. While the 
diffractive properties of neutrons had been 
demonstrated with heterogeneous beams from 
Ra-Be sources as early as 1936,!? the low 
* This article contains essentially the material which 
was presented in a paper at the Colloquium of College 


Physicists, held at the State University of Iowa, June 17 
to 20, 1953. 


( 036) Halban and P. Preiswerk, Compt. rend. 203, 73 
1936). 
( + 36) P. Mitchell and P. N. Powers, Phys. Rev. 50, 486 
1936). 


intensity of the neutron beams did not permit 
work on practical problems of application. How- 
ever, with the development of nuclear reactors 
during the past several years, intense beams of 
neutrons are now available which make it 
possible to obtain quantitative measurements of 
neutron diffraction effects. Although the neutron 
beams which are obtained from nuclear reactors 
have a lower intensity than corresponding beams 
from a good x-ray tube, most of the techniques 
of x-ray diffraction can be used with neutrons. 
The purpose of this article is to review briefly 
some of the advances which have been made in 
the field of neutron diffraction during the past 
several years. The material which will be pre- 
sented can be divided logically into three sec- 
tions: (1) a comparison of neutron diffraction 
with the older fields of x-ray and electron dif- 
fraction; (2) a description of the experimental 





264 M. K. WILKINSON 


techniques which have been developed in neu- 
tron diffraction; and (3) a discussion of some of 
the types of applications where additional, or 
perhaps unique, information is obtained by this 
method. 

For a comprehensive review on the subject, 
readers are referred to an excellent article 
written recently by Bacon and Lonsdale.’ 


II. COMPARISON OF NEUTRON DIFFRACTION 
WITH X-RAY AND ELECTRON DIFFRACTION 


While photons, neutrons, and electrons have 
widely different physical characteristics, each 
has a wavelength which depends on the energy 
of the particle. In order for a diffraction experi- 
ment to be possible, the wavelengths of the 
radiation must be an appropriate value. In the 
study of crystal or molecular structure, the 
wavelengths which are necessary must be com- 
parable to the distance between atoms in the 
crystals or molecules (about an angstrom). Fur- 
thermore, in most diffraction work, it is necessary 
that the radiation be monochromatic. By the 
proper choice of the target material in an x-ray 
tube, it is possible to obtain monochromatic line 
radiation with wavelengths of many values in 
this desirable region, and, for many crystal struc- 
ture studies, it is not necessary to perform addi- 
tional monochromatization on this line radiation. 
In electron diffraction work the electron beam 
is obtained by direct acceleration through elec- 
tric potentials, and the source of electrons is 
monochromatic within the limits of stability 
of the applied potential. In contrast, there is no 
easy method of obtaining a monochromatic 
beam of neutrons. While the neutrons which are 
produced from the fission process in a nuclear 
reactor have wavelengths much too small for 
crystal diffraction purposes, these neutrons are 
slowed down until they are approximately in 
thermal equilibrium with the atoms in the 
moderating material. The distribution of neutron 
wavelengths is essentially Maxwellian and for- 
tunately occurs in the proper region for diffrac- 
tion experiments with a maximum at about one 
angstrom. It is unfortunate, however, that the 
neutrons are not monochromatic, and a wave- 
length slice must be picked out of the continuous 


3G. E. Bacon and K. Lonsdale, Repts. Progr. Phys. 19, 
1 (1953). 


distribution. Such a narrow band of wavelengths 
can be obtained either by a mechanical velocity 
selector or by reflection from a single crystal, 
but the use of these monochromators reduces the 
intensity in the monochromatic beams to only a 
few percent of the original intensity. Further- 
more, the beam is not monochromatic in the true 
sense of the word, but possesses a finite spread 
of wavelengths, which for the monochromators 
normally used is perhaps 0.1 angstrom. 
Although diffraction patterns are obtained 
with all three types of radiation, the nature of 
the interaction of x-rays, electrons, and neutrons 
with individual atoms and nuclei is quite dif- 
ferent. The scattering of x-rays by atoms takes 
place almost entirely from the electrons in the 
atom, and there is negligible scattering from the 
nucleus. Since the electrons are distributed 
within the atom at distances which are com- 
parable to the wavelength of the x-rays being 
scattered, interference effects occur which reduce 
the intensity of scattered radiation at large 
angles, and x-ray scattering is said to exhibit a 
form factor decline in intensity. For electron 
radiation, scattering also takes place from the 
electronic cloud within the atom, but in this 
case, nuclear scattering is no longer negligible. 
Hence, additional interference effects are found, 
and the form factor for the scattering of elec- 
trons is even more pronounced than it is for 
x-rays. On the other hand, neutron scattering is 


TABLE I. Neutron and x-ray coherent scattering cross 
sections for several elements and isotopes. 








Element or isotope x-ray (barns) *neutrons (barns) 
0.66 
0.66 

42.5 

80 
112 
130 
351 
382 
415 
448 
484 
520 
520 
520 
520 

1160 

4460 

5610 


ONES hue 

WADOUUN 
Ww 2 
e 


_ 


— Noe 
OR NOR NW eee 


CUUNUROoROR 


« The neutron coherent scattering amplitudes for these elements (or 
isotopes) are negative. 


ee a ee ee ae al 





TECHNIQUES AND APPLICATIONS OF NEUTRON DIFFRACTION 265 


normally accomplished by short-range nuclear 
forces, and with one exception (which will be 
discussed later) there is a negligible neutron- 
electron interaction. Therefore, this is a scatter- 
ing process in which the wavelength of the radia- 
tion is much larger than the scattering center, 
and the scattering is isotropic. 

Since the scattering mechanism is different 
for the three types of radiation, it is not sur- 
prising to find that the relative scattering powers 
of the various atoms are also different. Table I 
gives a comparison of the coherent scattering 
cross sections of a number of elements for neu- 
trons and x-rays. These cross sections are pro- 
portional to the squares of the coherent scatter- 
ing amplitudes, and the latter determine the 
intensity of the radiation which is scattered at 
various angles by Bragg reflection. Hence, the 
values in Table I represent the relative efficien- 
cies of the various atoms for scattering neutrons 
and x-rays. It is seen that there is a regular varia- 
tion in the x-ray values, and in fact, the x-ray 
cross sections are proportional to the square of 
the atomic number. In contrast, there is no 
regularity to the values of neutron cross sections. 
In general, light elements scatter as effectively 
as the heavy ones, and neighboring atoms on the 
periodic table may scatter quite differently. For 
neutrons of nearly thermal energies, the indi- 
vidual nuclear scattering processes will be elastic, 
since nuclear energy level spacings are large 
compared to kT, and the cross sections will be 
essentially constant over the thermal region 
except when a resonance is found in this region. 
However, the actual value of the cross section 
is dependent on the presence of resonance levels 
which occur at other energies. When the scatter- 
ing element possesses more than one isotope, no 
effect is observed in the x-ray scattering since 
the electronic structure in the different isotopes 
is closely the same, but the neutron scattering 
properties of the isotopes may be quite different. 
The neutron coherent scattering cross section 
for the element will correspond to a weighted 
average cross section for the isotopes, and a 
crystal containing several isotopes will produce a 
certain amount of diffuse neutron scattering 
analogous to the usual disorder scattering en- 


countered in x-ray investigations of disordered 
alloys. 


Pb MONOCHROMATING CRYSTAL 


SCATTERING SPECIMEN 
7 i 


\ ~ — COUNTER 


MONITOR COUNTER 
SHIELDING 


REACTOR SHIELD 


Fic. 1. Schematic diagram of a typical neutron 
diffraction spectrometer. 


It was mentioned earlier that with one excep- 
tion neutron scattering is a consequence of the 
interaction between the neutron and the atomic 
nucleus. This exception occurs when the elec- 
tronic configuration of the atom imparts a per- 
manent magnetic moment to the atom. In these 
cases, such as the ferromagnetic elements of the 
iron group, there are incomplete electron shells 
within the atom which are responsible for the 
magnetic behavior of the material. Since the 
neutron also possesses a magnetic moment, there 
will be an interaction between these two mag- 
netic moments, and the total scattering will be 
a superposition of the magnetic and nuclear 
contributions. The intensity of the magnetic 
scattering is comparable to that of the purely 
nuclear scattering and in some cases even ex- 
ceeds it, but in contrast to the nuclear scattering, 
the magnetic scattering of neutrons has a form 
factor dependence. This form factor results be- 
cause the electrons which are responsible for the 
atomic magnetic moment are distributed within 
the atom at distances comparable to the wave- 
length of the radiation. It is somewhat different 
from the normal x-ray form factor, since it is 
representative only of those electrons which 
contribute to the magnetic moment of the atom. 
This magnetic interaction between atoms and 
neutron radiation has no counterpart in x-ray 
scattering, and while there is a magnetic inter- 
action in the scattering of electron radiation, it 
is very small compared to the normal Coulombic 
interaction and has never been observed. 


Ill. EXPERIMENTAL TECHNIQUES IN 
NEUTRON DIFFRACTION 
Most of the techniques which have been de- 
veloped for neutron diffraction experiments have 
required monochromatic radiation. Therefore, 





266 M. K. WILKINSON 


nearly all neutron diffraction spectrometers are 
similar in principle to the one described by 
Wollan and Shull,‘ and a schematic diagram of a 
typical spectrometer is shown in Fig. 1. The 
heteroenergetic neutrons from the reactor emerge 
through a hole in the reactor shield and strike 
a single-crystal monochromator. A narrow band 
of the continuous spectrum is selected by Bragg 
reflection from the monochromating crystal, and 
this approximately monochromatic beam passes 
through a collimating tube in the shielding 
around the monochromator and strikes the scat- 
tering specimen which is mounted on a con- 
ventional spectrometer. Detection of the scat- 
tered radiation is performed by a neutron counter 
(usually a BF; gas proportional counter enriched 
in the isotope B"”) which traverses the angle 
around the scattering specimen. An additional 
neutron counter is placed in the primary mono- 
chromatic beam as a monitor, so that all data 
can be normalized to a particular neutron in- 
tensity regardless of changes in the power level 
of the reactor. Even though the flux at the 
center of the reactor may be of the order of 10” 
neutrons/cm? sec, the monochromatic beam con- 
tains a neutron current of only about 200 000 
neutrons/sec. Single crystals of NaCl, CaCOQs, 
Pb, Cu, Be, Ge, and Fe have been used as 
monochromators, and in general the metallic 
crystals give more intense beams, since their 
mosaic spread more nearly matches the geo- 
metrical angular spread of the collimating sys- 
tem. This allows the collimating system to be 
filled more completely with radiation, but it also 
results in somewhat poorer wavelength resolu- 
tion. Of course, for certain specific problems it is 
desirable to use monochromating crystals which 
possess special characteristics such as the use of 
calcite when extremely good resolution is 
necessary. 

Two techniques have been used in the studies 
of crystalline materials: (1) the Debye-Scherrer- 
Hull powder method of analysis; and (2) the 
Bragg scattering from single crystals.*.* Each of 
these methods has definite advantages, and in 


(1948) O. Wollan and C. G. Shull, Phys. Rev. 73, 830 
5S. W. Peterson and H. A. Levy, J. Chem. Phys. 19, 
1416 (1951). 
®G. E. Bacon, Proc. Roy. Soc. (London) A209, 397 
(1951). 


some cases it is advantageous to obtain data 
using both methods. 

The powder method is generally used when 
accurate intensity measurements are desired, 
and of course, this method must be used when 
single crystals of a material are not available. 
The polycrystalline sample, in the form of a 
powder or filings, is placed in either a flat or 
cylindrical cell, depending on the type of experi- 
ment and the amount of material available. In 
general, the flat cells have cross-sectional dimen- 
sions of about 2 in. X 2 in. and a thickness of 
about 3? in., while the cylindrical cells are about 

4 in. high with a 3-in. diameter. Cells are 
usually made from vanadium or aluminum be- 
cause these elements have low values of neutron 
coherent scattering cross sections; and, conse- 
quently, there is only a small scattering con- 
tribution from the cell. Interpretation of the 
diffraction pattern in terms of the scattering 
amplitudes of the atoms in the scattering speci- 
men can be made with the aid of equations which 
have been adapted from corresponding equations 
applicable to x-ray diffraction.’ For a flat sample, 
the expression is 


Pret 1 hp’ 4h 08 
= ————— Janel nas, (1) 
Py) 4nr p_ sin?26 


where P,: is the total power in an (hkl) reflection 
as measured through a counter slit opening of 
height / at a distance r from the sample, Po the 
primary beam power, A the neutron wavelength, 
p’ the measured density of the powder, p the 
density of the solid crystal, » the linear attenua- 
tion coefficient, 4 the thickness of the specimen, 
6 the Bragg angle of reflection, j;.. the number 
of planes which contribute to the scattering at 
this angle, N the number of unit cells per cm’ 
of crystal, and Fy; the scattering amplitude per 
unit cell for the particular crystal structure. 
Fix: is a linear combination of the coherent scat- 
tering amplitudes of the scattering centers in the 
crystal, and because of the loss of Bragg scatter- 
ing due to the interaction of the neutrons with 
the lattice vibrations, it also contains a correction 
for the temperature of the sample. A similar 

7See, for example, A. H. Compton and S. K. Allison, 


X-Rays in Theory and Experiment (D. Van Nostrand Com- 
pany, Inc., New York, 1946), p. 415. 


a ae” a” ae” ee ee. ee re | 





TECHNIQUES AND APPLICATIONS OF NEUTRON DIFFRACTION 267 


scattering expression is used when the sample is 
contained in a cylindrical cell. The direct meas- 
urement of the power Po in the monochromatic 
incident beam is a difficult problem, and it is 
more convenient and accurate to compare the 
diffracted intensities of the sample with those 
produced by a substance of known coherent 
scattering cross section. For a monoisotopic 
element with zero nuclear spin, the coherent 
scattering cross section and the total scattering 
cross section are identical, and the latter can be 
obtained from a simple transmission experiment. 

A typical powder diffraction pattern is shown 
in Fig. 2 for a sample of molybdenum powder. 
This sample was enclosed in a_ thin-walled 
cylindrical aluminum cell held at the tempera- 
ture of liquid hydrogen and irradiated by neu- 
trons with a wavelength of 1.212 angstroms. 
The widths of the diffraction peaks are perhaps 
three times larger than powder pattern peak 
widths obtainable with present-day x-ray tech- 
niques. This poorer resolution results from the 
less refined geometry which is necessitated by 
the lower intensity available in the neutron ex- 
periments. It is not a serious problem for the 
patterns produced by crystals possessing high 
symmetry where the diffraction lines are well 
separated, but it does become troublesome in 
obtaining unresolved reflections for more com- 
plex crystals of lower symmetry. There are 
several small reflections in the molybdenum 
pattern which are due to the second-order wave- 
length contamination in the ‘‘monochromatic”’ 
beam incident on the sample. Under special 
conditions this second-order wavelength radia- 
tion can be removed by filters or the use of 
specific reflecting planes in certain monochromat- 
ing crystals. 

The single-crystal technique is generally a 
more useful method for crystallographic work, 
because the resolution of the diffraction peaks is 
considerably better than that obtained with 
powders. However, the effects of extinction and 
crystal distortion make absolute intensity meas- 
urements somewhat less reliable. Extinction® 
causes the intensity of radiation incident on the 
crystal to decrease more rapidly on passage 
through the crystal than would be predicted 


8For definitions of primary —< fereeineaid extinction 
see, for example, reference 7, p. 4 


MOLYBDENUM 


20°K 


(0) 


INTENSITY (NEUTRONS / MIN) 


(200) 
' 


Mos — (100) 2) (3102 (321) 
me 00) nh ire (sen 


SCATTERING ANGLE 


Fic. 2. Neutron diffraction pattern of molybdenum 
powder at 20°K. 


from the measured value of the attenuation 
coefficient, and in order to prevent these effects 
from appreciably affecting single crystal meas- 
urements, it is necessary that the crystals have 
dimensions of the order of one millimeter. The 
criterion for the maximum thickness of a single 
crystal in neutron work has been given by Bacon 
and Lowde.® Even though the total volume of the 
sample is considerably smaller than the volume 
necessary in the powder technique, the peak 
intensities obtained in the two methods are 
comparable. In addition to a resolution which 
is perhaps twice as good in the single-crystal 
method, the smaller sample size also reduces 
the diffuse scattering relative to the intensity 
of Bragg reflections. Interpretation of the data 
can be made by application of the following 
equation which is adapted from the standard 
x-ray expression :° 
Io N?V | FP nxt! 
Ena, (2) 
w sin26 

where Exx: is the integrated intensity corrected 
for absorption in an (hk/) reflection, Jo the in- 
cident neutron intensity, \ the neutron wave- 
length, w the angular velocity of rotation of the 
crystal on the spectrometer, N the number of 
unit cells per unit volume of the crystal, V the 
volume of the scattering specimen, 6 the Bragg 
angle of reflection, and F),; the scattering ampli- 


9G. E. Bacon and R. D. Lowde, Acta Cryst. 1, 303 
(1948). 

10 International — for the Determination of Crystal 
Structures (J. W. Edwards, Ann Arbor, Michigan, 1944), 
revised edition, Vol. 2, p. 560. 





268 M. K. 


tude per unit cell of the particular crystal struc- 
ture. The value of Jo is established by a compari- 
son with similar measurements on a standard 
single crystal which is known not to be subject 
to extinction and has likewise been carefully 
calibrated against a powdered sample of a ma- 
terial with a known coherent scattering cross 
section. 

There are many interesting problems which 
require the investigation of neutron patterns at 
various temperatures. Unlike x-ray diffraction 
studies, there are no particular problems in the 
design of the necessary auxiliary equipment, 
because neutron radiation can pass through 
vacuum jackets and radiation shields with very 
little loss in intensity. Low-temperature cryo- 
stats have been used to obtain data at tempera- 
tures as low as 4.2°K, while high-temperature 


furnaces have maintained sample temperatures 
up to 1250°K. 


IV. FIELDS OF APPLICATION OF NEUTRON 
DIFFRACTION 


It is only natural that the neutron diffraction 
techniques have been developed around those 


300 


© 
°o 
°o 


z 
= 
5 
E 
3 
$s 
e 
3 
¢ 
z 


(mw) aw (200) 
° 


28 32 36 40 28 32 36 40 


WILKINSON 


problems where additional or unique results 
could be obtained. These problems can be 
classified into four groups: (1) nuclear physics 
studies; (2) chemical crystallographic problems; 
(3) magnetic scattering phenomena; and (4) 
problems involving the physical structure of 
matter. While the remaining sections of this 
article describe specific examples of the various 
fields of applications, it should be emphasized 
that the descriptions are brief and that the list 
of illustrations is by no means a complete one. 


Nuclear Physics Studies 


Since the scattering of neutron radiation by an 
atom is almost entirely a nuclear scattering 
process except in the special cases where the atom 
possesses a magnetic moment, information is ob- 
tained about the neutron-nuclear interaction. 
This interaction is the result of short-range 
nuclear forces, about which little is known, and 
neutron diffraction experiments are one of the 
many sources contributing information which 
will lead to a clearer understanding of nuclear 
theory. 


qu 


28 32 36 40 28 32 36 40 


COUNTER ANGLE (DEGREES) 


Fic. 3. Portions of neutron powder diffraction patterns for isotopically enriched samples of NiO. 





TECHNIQUES AND APPLICATIONS OF NEUTRON DIFFRACTION 269 


One of the first major contributions of neutron 
diffraction to nuclear theory was the measure- 
ment of the coherent scattering amplitude of 
hydrogen." This measurement was important 
since it involved the interaction between two 
fundamental particles and was therefore a prob- 
lem which could receive a thorough theoretical 
treatment. From the value of the coherent 
scattering amplitude it was possible to evaluate 
the scattering amplitudes characteristic of the 
two spin states of scattering. These data and 
earlier experiments on the scattering of slow 
neutrons by ortho- and para-hydrogen showed 
that the interaction of the proton and neutron is 
spin dependent, that is, the short-range force 
which the neutron experiences near a proton is 
strongly dependent on the relative orientation 
of the spins of the two particles. Furthermore, 
it was possible to obtain a value for the range 
of the neutron-proton triplet interaction. The 
most accurate determination of this interaction 
was made recently by Burgy, Ringo, and 
Hughes” utilizing total reflection of slow neu- 
trons from liquid hydrocarbons. 

Neutron diffraction patterns have been taken 
for well over a hundred elements or compounds, 
and from these measurements scattering cross 
sections and scattering phase data have been 
determined for over sixty elements or separated 
nuclides.'® These measurements are of course 
necessary in the interpretation of crystallo- 
graphic structure problems, but in addition, they 
have given information of direct interest in 
nuclear physics. For example, both the magni- 
tude of the scattering cross section and the sign 
of the scattering amplitude give information 
concerning the effects of resonance scattering by 
the nucleus. As an illustration of this, consider 
the investigation made on the isotopes of nickel." 
This study was of interest since the scattering 
data from normal nickel had indicated that the 
scattering properties of the nickel isotopes 
would be widely different. Portions of the dif- 
fraction patterns which were taken for samples 
of NiO enriched in Ni®*, Ni®, and Ni®, as com- 


11 Shull, Wollan, Morton, and Davidson, Phys. Rev. 73, 
842 (1948). 

2 Burgy, Ringo, and Hughes, Phys. Rev. 84, 1160 (1951). 
(9st) . Shull and E. O. Wollan, Phys. Rev. 81, 527 
4 Koehler, Wollan, and Shull, Phys. Rev. 79, 395 (1950). 


X-RAYS 


(400) (331420) = (422) 


INTENSITY 


COUNTER ANGLE 


Fic. 4. X-ray and neutron powder diffraction 
patterns of NaH. 


pared with normal NiO, are shown in Fig. 3. 
Since NiO has the rocksalt structure, it is very 
easy to determine the phase of scattering of the 
nickel samples relative to oxygen. All scatter 
with the same phase as oxygen except Ni®, 
which as indicated by the very weak (200) 
reflection in the Ni®O pattern, has a scattering 
amplitude of the opposite sign. Furthermore, it 
is clearly seen that the scattering amplitude of 
Ni®’ is very much larger than that of Ni®. These 
data indicate a virtual resonance for Ni®® and 
positive energy resonances for Ni® and Ni®. 


Chemical Crystallographic Problems 


There are certain problems in chemical crystal- 
lography where neutron diffraction can be ap- 
plied more readily than x-ray or electron dif- 
fraction merely because the scattering cross 
sections are more favorable. This type of 
problem, of course, occurs in studies where the 
uniform variation of scattering cross section 
with atomic number as found for both x-ray and 
electron radiation is disadvantageous. 

Structure studies of composite crystals which 
contain both heavy atoms and light atoms fall 
into this classification. In x-ray and electron 
diffraction, the intensities observed in the dif- 
fraction peaks contain such a tremendous con- 
tribution from the heavy atoms that little or no 
information is obtained concerning the positions 
of the light atoms. One of the most important 
problems of this type is the location of hydrogen 
atom positions in crystals. A simple example of 
such a situation is the study of NaH, and Fig. 4 





270 


gives the x-ray and neutron diffraction patterns 
for a polycrystalline sample." As listed in Table 
I, the x-ray scattering cross section for Na is 
over one hundred times larger than the x-ray 
scattering cross section for H, while for neutron 
radiation, the two cross sections are comparable. 
Hence, the reflections in the x-ray pattern are 
characteristic only of the sodium positions in 
the crystal, whereas the neutron pattern is repre- 
sentative of the scattering from both types of 
atoms. A detailed study of the neutron data 
shows that the structure of NaH is similar to 
that of NaCl and that hydrogen scatters with an 
amplitude which is opposite in sign to that of 
sodium. The very intense diffuse scattering in 
the neutron pattern is a consequence of the spin- 
dependent incoherent scattering from hydrogen, 
and this diffuse scattering has an angular varia- 
tion since the protons in the lattice were not 
rigidly bound relative to the neutron energy 
which was used. For studies where a large diffuse 
background is troublesome, the use of deuterated 
samples may be more feasible or the single 
crystal method of analysis may be used. One of 
the most comprehensive investigations of the 
positions of hydrogen atoms in crystals is the 
study of the phase transformations of the am- 
monium halides by Goldschmidt and Hurst!® 
and by Levy and Peterson.'* In addition to their 
importance in nuclear physics, problems involv- 
ing the scattering from hydrogen atoms in 
crystals are important in structural chemistry 


INTENSITY (NEUTRONS/MIN) 


° 0 40 


20 30 
SCATTERING ANGLE 


Fic. 5. Neutron diffraction pattern of ordered 
polycrystalline FeCo. 


18 G. H. Goldschmidt and D. G. Hurst, Phys. Rev. 83, 
88 (1951); 86, 797 (1952). 

16H. A. Levy and S. W. Peterson, Phys. Rev. 83, 1270 
(1951) ; 86, 766 (1952); J. Am. Chem. Soc. 75, 1536 (1953). 


M. K. WILKINSON 


since the locations of hydrogen positions in 
crystals give information concerning single elec- 
tron bonding. 

Neutron diffraction techniques are also more 
favorable in studies of the structure of binary 
alloys which contain atoms with almost the same 
atomic number. When some of these alloys are 
given a suitable heat treatment, they form or- 
dered solid solutions in which one atomic species 
becomes located preferentially with respect to 
the other. Diffraction patterns of these ordered 
structures, or “‘superlattices,’’ show additional 
reflections, the intensities of which depend on 
the degree of order in the sample and the dif- 
ference between the scattering amplitudes of the 
two atoms. The regular variation of x-ray scat- 
tering amplitudes with atomic number makes 
investigations of this type very difficult, since 
the intensities of the superlattice reflections are 
extremely weak regardless of the degree of order, 
while in many of these cases neutron diffraction 
investigations can easily be performed.!” The 
FeCo system is a good example of this type of 
study, and Fig. 5 represents the neutron dif- 
fraction pattern for an ordered sample. The 
(110) and (200) reflections are normal lattice 
reflections which would occur regardless of the 
order in the alloy, while superlattice reflections 
are seen at the (100) and (111) positions. In the 
ordered structure the body-centered cubic unit 
cell contains cobalt atoms at the body-centered 
positions and iron atoms at the corner positions. 
From the neutron data it is quite easy to make 
an accurate determination of the degree of order 
in the alloy, while x-ray investigations must be 
made under special conditions even to establish 
that an ordered structure exists. Investigations 
of order-disorder phenomena in alloys contain- 
ing nearby transition elements are very im- 
portant in solid-state physics, since many of the 
physical and magnetic properties of these ma- 
terials are quite sensitive to the degree of order 
in the sample. 

Although the techniques have not been fully 
developed at the present time, it is apparent 
that neutron diffraction will be more favorable 
than x-ray or electron diffraction for successful 
attack on structure studies of liquids and vitreous 


17C, G. Shull and S. Siegel, Phys. Rev. 75, 1008 (1949). 





TECHNIQUES AND APPLICATIONS OF NEUTRON DIFFRACTION 271! 


X-RAY— DATA OF WARREN, 4936 


—— 


0.343 0-389 


X-RAY INTENSITY, ELECTRON UNITS 


0.5 


DIFFRACTION PATTERNS FROM VITREOUS SILICA PLATES 


NEUTRON od = O.78A 
@d=4.24A 


NEUTRON INTENSITY, COUNTS PER MINUTE 


sin 6/d 


Fic. 6. X-ray and neutron diffraction patterns of vitreous silica. 


materials. The advantage of neutron diffraction 
in these problems does not necessarily arise from 
more favorable values of scattering amplitudes, 
but from more detailed structure in the diffrac- 
tion pattern particularly at large values of the 
scattering angle. This additional structure is 
mainly due to the difference in the scattering 
interaction of the radiation with the atoms in 
the material. The neutron scattering amplitudes 
are isotropic in contrast to the form factor de- 
pendence which the x-ray and electron scattering 
amplitudes experience. Figure 6 shows the neu- 
tron diffraction pattern for fused silica obtained 
by Milligan, Levy, and Peterson!*® together with 
a similar x-ray pattern obtained by Warren, 
Krutter, and Morningstar."® The additional de- 
tail in the neutron pattern is immediately obvi- 
ous. Whereas the x-ray pattern has one large 
well-defined maximum and then some less re- 
solved weaker peaks, there are several well- 
defined maxima in the neutron pattern. The 
exact method of converting the neutron data 


( 18 _ Levy, and Peterson, Phys. Rev. 83, 226 
1951). 

1 Warren, Krutter, and Morningstar, J. Am. Ceram. 
Soc. 19, 202 (1936). 


into a curve showing the radial distribution of 
atomic density surrounding each atom has not 
been completely established, but preliminary 
calculations show that the additional coherent 
effects on the neutron diffraction pattern pro- 
vide a radial distribution curve with much better 
resolution than is found in a similar curve ob- 
tained from x-ray data. 


Magnetic Scattering Phenomena 


£ One of the most important applications of 
neutron diffraction is the study of magnetic 
scattering phenomena. In problems of this type, 
the data are unique, since neither x-ray nor 
electron diffraction can furnish similar informa- 
tion. The neutron experiences a magnetic inter- 
action with any localized atomic magnetic mo- 
ments in the substance, and the diffraction 
pattern is representative of the magnetic phase 
in which the material exists. 

In a paramagnetic substance, the atomic mag- 
netic moments are randomly oriented, and since 
there is no periodicity of the alignment of mag- 
netic moments within the lattice, the scattering 
will appear as incoherent or diffuse scattering. 





M. K. WILKINSON 


ts 
| 29°6.854 


INTENSITY (NEUTRONS/ Min) 


SCATTERING ANGLE 


Fic. 7. Neutron diffraction patterns of MnO powder taken 
at room temperature and liquid nitrogen temperature. 


Like all neutron magnetic scattering, paramag- 
netic scattering has a form factor dependence 
so that the diffuse scattering diminishes for 
larger values of the scattering angle. The theory 
of neutron scattering by a true paramagnetic 
substance has been investigated by Halpern 
and Johnson,” and for a bound atom or ion 
whose magnetic moment is due entirely to an 
electronic spin S, the differential cross section 
(cross section per unit solid angle) for magnetic 
scattering per atom is given by 


dom =3S(S+1) (ey/me?)?f7dQ, (3) 


where f is the amplitude form factor, y is the 


MAGNETIC 
UNIT CELL 


Mn_ATOMS IN MnO 


Fic. 8. Antiferromagnetic structure existent in 
MnO below 120°K. 


200. Halpern and M. H. Johnson, Phys. Rev. 55, 898 
(1939). 


"3d" ELECTRON DISTRIBUTION IN Mn” 


EXPERIMENTAL 


ulr)dr, NO. OF ELECTRONS IN dr 


RADIUS (A) 
as 12 1.6 


Fic. 9. Comparison of the theoretical and experimental 
— distributions of electron density in the 3d shell 
of Mn**. 


neutron magnetic moment expressed in nuclear 
Bohr magnetons, and the other terms have their 
conventional significance. 

When the temperature of many paramagnetic 
substances is decreased, a transition temperature 
is passed where the magnetic moments become 
locked into parallel or antiparallel orientation 
by strong quantum-mechanical exchange forces. 
Such substances, called ferromagnetic and anti- 
ferromagnetic, respectively, have lattices in 
which there is a definite repetition distance with 
respect to the directions of the magnetic mo- 
ments, and additional reflections are seen on the 
diffraction pattern. For ferromagnetic sub- 
stances, the magnetic reflections occur at the 
same positions as the nuclear reflections, while 
antiferromagnetic reflections will be found at 
new positions where there may not be any 
nuclear coherent scattering. Halpern and John- 
son” have also given a theoretical treatment of 
neutron scattering from these oriented magnetic 
lattices for those cases where the magnetic 
properties of the atom or ion are due to the elec- 
tronic spins. The general expression for the dif- 
ferential scattering cross section of a magnetic 
ion has been found to be 


F°=6?+2bpq-2+p'¢°, (4) 


where 6 is the nuclear scattering amplitude, p 
the magnetic scattering amplitude, 2 a unit 
vector describing the polarization state of the 
neutron being scattered, and 


q=e(e-x)—x, 


(S) 





TECHNIQUES AND APPLICATIONS OF NEUTRON DIFFRACTION 273 


so that 


q’=1—(e-»)*. (6) 


In these equations e is the unit scattering vector 
given by k—k’/|k—k’|, where k and k’ are the 
incident and scattered wave vectors and x is a 
unit vector parallel to the magnetic moment of 
the magnetic ion. The magnetic scattering ampli- 
tude p is given by 


p= (e/me*)ySf, (7) 


where the terms are the same as those in Eq. (3). 
It should be emphasized that the square of p in 
Eq. (4) involves a numerical square of S rather 
than the quantum-mechanical square of S which 
appeared in Eq. (3). If the incident neutron 
radiation is unpolarized, the average value of 
q-4 is zero, and the differential cross section is 
the sum of the nuclear and magnetic con- 
tributions 


P=B+p¢. (8) 


The value of g?, as shown in Eq. (6), depends 
upon the relative orientations of the scattering 
vector and the magnetization vector, and for 
the particular case where x is randomly directed, 
q@ has a value of 3. 


ARMCO 


INTENSITY (NEUTRONS /MIN ) 


50° 


Neutron diffraction patterns of MnO ob- 
tained by Shull, Strauser, and Wollan,” are 
characteristic of those for a material which passes 
from a paramagnetic phase to an antiferromag- 
netic phase as the temperature is lowered. The 
transition temperature of MnO is 120°K, and 
Fig. 7 shows the diffraction patterns of a 
powdered sample of MnO taken at 293°K and 
80°K. The room temperature pattern contains 
both paramagnetic diffuse scattering and the 
nuclear diffraction peaks, while at the low tem- 
perature, the magnetic scattering is concentrated 
in additional reflections. Since the divalent 
manganese ion, Mn**, is in a spectroscopic S 
state with five electrons in the 3d shell whose 
spins are aligned parallel, the spin quantum 
number is 5/2, and the intensity of the magnetic 
scattering is comparable to that of the nuclear 
scattering. The extra magnetic reflections cannot 
be indexed on the basis of the conventional unit 
cell, but require a unit cell which has an axial 
length twice as long as the chemical cell. The 
magnetic structure of MnO, which was obtained 
from a detailed study of the reflections, is por- 
trayed in Fig. 8, and shows the reason why the 
repetition distance for identical scattering power 


IRON FILINGS 


60° 70° 


SCATTERING ANGLE 


Fic. 10. Neutron diffraction pattern of polycrystalline Fe at room temperature. 


*1 Shull, Strauser, and Wollan, Phys. Rev. 83, 333 (1951). 





274 M. 


is twice as long for magnetic scattering. In the 
diffraction patterns shown in Fig. 7, the intensi- 
ties of both the diffuse paramagnetic scattering 
at room temperature and the magnetic reflec- 
tions at 80°K fall off with a form factor de- 
pendence, and the exact angular dependence of 
the form factor for Mn** can be determined 
directly from the patterns. Actually, the room 
temperature pattern of MnO is not representa- 
tive of a truly paramagnetic substance. The 
broad maximum which occurs at about 11 de- 
grees suggests that at room temperature, the 
magnetic moments in MnO are not oriented 
completely at random, but that there exists a 
short-range magnetic order which gives rise to 
residual coherence somewhat analogous to the 
scattering by liquids. However, from data taken 
on several paramagnetic salts containing Mn*+ 
ions at various temperatures,”! the form factor 
for the Mn** ion was obtained. Since this form 
factor is representative of only those electrons 
contributing to the atomic magnetic moment, 
it was used to obtain the radial distribution 
curve of the 3d shell within the atom. This ex- 
perimental curve showing the radial distribution 
of electron density in the 3d shell of Mn** is 
shown in Fig. 9 with a theoretical curve which 
Dancoff calculated by a self-consistent field 
analysis. 

Magnetic scattering by ferromagnetic ma- 
terials can be studied with either magnetized or 
unmagnetized samples. In both cases the mag- 
netic moments of the atoms within a domain are 


POLYCRYSTALLINE Fe 
EXPERIMENT 
yi 


STEINBERGER - WICK 
FORM FACTORS 


DIFFERENTIAL SCATTERING 
CROSS SECTION (10°‘cm*/ STERADIAN) 


enty 


Fic. 11. Angular variation of the differential scattering 
cross section per Fe atom. 


K. WILKINSON 


in parallel alignment so that there are magnetic 
reflections which occur at the same positions as 
the nuclear reflections. The application of an 
external magnetic field to the sample is merely 
a method by which the amount of magnetic 
scattering can be controlled. 

The ‘“unmagnetized’”’ method will be con- 
sidered first, and Fig. 10 shows the neutron 
diffraction pattern obtained for an unmagnetized 
sample of polycrystalline iron.” The observed 
reflections have both magnetic and nuclear 
contributions, and these can be separated by 
converting the intensities in the reflections to 
differential scattering cross sections as shown in 
Fig. 11. Since magnetic scattering exhibits a 
form factor dependence, the angularly dependent 
part is the magnetic contribution, while the 
isotropic portion is nuclear scattering. The dotted 
lines through the plotted points are an indication 
of the agreement between experiment and theory. 
These lines were calculated tising a magnetic 
moment per iron atom of 2.22 Bohr magnetons 
as determined from saturation magnetization 
experiments together with the form factors 
determined theoretically by Steinberger and 
Wick.” The Steinberger-Wick calculations give 
two slightly different form factors depending on 
the assumptions of the asymptotic behavior 
of the 3d-wave functions, and the experimental 
data are not considered sufficiently accurate to 
distinguish between the two. 

Since it is frequently difficult to make an 
accurate determination of the isotropic and 
angularly dependent parts of the scattering, the 
magnetic scattering from a ferromagnetic ma- 
terial can usually be determined more readily 
from investigations on magnetized samples. The 
magnetic contribution to the observed intensity 
depends on the relative orientations of the unit 
scattering vector and the atomic magnetic 
moment vector according to Eq. (6) and Eq. (8). 
Since the direction of the atomic magnetic 
moment in a ferromagnetic material can be con- 
trolled by the application of a sufficiently high 
external magnetic field, the amount of magnetic 
scattering will change as the direction of the 
external field is varied. From a comparison of 

22 Shull, Wollan, and Koehler, Phys. Rev. 84, 912 (1951). 


23 J. Steinberger and G. C. Wick, Phys. Rev. 76, 994 
(1949). 


ee a a a a ee a ee ee ee ee | 


tinanrnnst ah hie 46 












the observed change with that calculated from 
theory, the magnitude of the magnetic scattering 
contribution can be determined. The intensity 
effects which arise with magnetized samples of 
iron are quite small, since, as shown in Fig. 11, 
the magnetic contribution to the scattering cross 
section is small compared to the nuclear portion. 
A better example is found in the (111) reflection 
of Fe;0, (magnetite). Magnetite possesses a 
spinel-type structure and is ferromagnetic to 
about the same extent as nickel. The (111) 
Fe;0, reflection is almost entirely magnetic, 
and Fig. 12 shows neutron data taken for a 
polycrystalline sample under three different con- 
ditions.” For the situation in which the external 
field aligned the atomic magnetic moments 
parallel to the scattering vector, the value of ¢ 
in Eq. (8) was zero, and there was no magnetic 
scattering. When the direction of the atomic 
magnetic moments was perpendicular to the 
scattering vector, g? was equal to unity, and the 
intensity of magnetic scattering was greater 
than that obtained for the unmagnetized sample 
when q? was 3. 

There are many materials whose magnetic 
properties are fundamental to a better under- 
standing of solid-state physics, and some of 
these materials have been investigated with the 
neutron diffraction techniques that have been 
described. Considerable interest is found in the 
magnetic behavior of the ferrites, and in addition 
to the work on magnetite which was mentioned 
previously, detailed investigations have been 
performed on other ferrites to determine both 
the chemical and magnetic structures.* A study 
was recently performed on the transition metals** 
to determine any magnetic structures existent 
in the nonferromagnetic elements of this group, 
and both chromium and a-manganese were 
found to be slightly antiferromagnetic with 
transition temperatures of 475°K and 100°K, 
respectively. Investigations have also been made 
on ferromagnetic binary alloys of the transition 
elements”® to determine the individual atomic 


24 





J. M. Hastings and L. M. Corliss, Revs. Modern 
Phys. 25, 114 (1953); G. E. Bacon and F. F. Roberts, 
Acta Cryst. 6, 57 (1953); Corliss, Hastings, and Brock- 
man, Phys. Rev. 90, 1013 (1953). 

2 C, G. Shull and M. K. Wilkinson, Revs. Modern 
Phys. 25, 100 (1953). 


26 M. K. Wilkinson and C. G. Shull, Phys. Rev. 90, 374 
(1953). 


TECHNIQUES AND APPLICATIONS OF NEUTRON DIFFRACTION 275 









(ttt) Fe, MAGNETIC REFLECTION 


UNMAGNETIZEO 


INTENSITY 


SCATTERING ANGLE 


Fic. 12. Intensity variation of the (111) Fe;O, neu- 
tron diffraction reflection with different directions of 
magnetization. 


magnetic moments associated with each element 
in the alloy, and recent studies of the para- 
magnetic scattering from rare earth oxides?’ 
have given information on the electronic dis- 
tribution in the 4f shell. 


V. PHYSICAL STRUCTURE PROBLEMS 


The application of neutron diffraction tech- 
niques to problems involving the physical struc- 
ture of matter also arises because of certain 
favorable properties of the neutron radiation. 
In these problems the most important considera- 
tions are the transparency of most materials to 
neutrons and the ease with which studies with 
long-wavelength radiation can be accomplished. 
Although this field of neutron diffraction has 
not been exploited as much as those previously 
mentioned, some very interesting investigations 
have been performed. 

Neutron diffraction studies of cold-worked 
brass have been reported recently by Weiss, 
Clark, Corliss, and Hastings.?* These investiga- 
tions were made in an attempt to settle certain 
points of disagreement which had arisen in 
similar x-ray studies. The neutron results indi- 


27W. C. Koehler and E. O. Wollan, Phys. Rev. (to be 
published). 


28 Weiss, Clark, Corliss, and Hastings, J. Appl. Phys. 
23, 1379 (1952). 











































































































































































































M. K. WILKINSON 


Fic. 13. Intensity variation around the (110) Debye- 
Scherrer ring for two Fe blocks. 


cated that in the absence of extinction effects, 
the integrated intensities from powder samples 
of cold-worked and annealed brass were identical. 
Furthermore, the diffuse scattering in a pattern 
of highly cold-worked brass was unchanged from 
that of the annealed sample by less than 0.4 
percent. 

Weiss has also made investigations on the 
small-angle scattering of neutrons.”® The broaden- 
ing of a neutron beam in traversing finely 
divided materials was measured as a function 
of the neutron wavelength, particle size, and 
coherent scattering cross section of the material. 
The experimental results were in good agree- 
ment with the scattering theory which predicted 
different effects for the two cases where the 
difference between the phase change in traversing 


22 R. J. Weiss, Phys. Rev. 83, 379 (1951). 


the particle diameter and the phase change in 
traversing the same length in vacuum was either 
large or small. 

The study of crystallite orientation effects in 
large samples of material is a problem to which 
neutron diffraction techniques are readily ap- 
plied. Figure 13 shows the intensity distribution 
around the (110) Debye-Scherrer ring produced 
with two samples of polycrystalline iron” which 
had a thickness of one centimeter. The dashed 
line is for a sample of cold-rolled steel (grain 
size about 25u) and the dotted line is for an 
Armco block (grain size about 50z). In contrast 
to a similar study with x-rays, the data are 
representative of the bulk of the material and 
not merely a surface layer. 


VI. CONCLUSION 


There are many other neutron diffraction in- 
vestigations which could have been mentioned 
in this article, but the illustrations that have 
been given are fairly representative of the types 
of problems which have been studied. Although 
the field of neutron diffraction is relatively new, 
valuable information has been obtained both in 
studies which supplemented investigations by 
x-ray and electron diffraction and in studies 
where other types of research were impossible. 
A really rapid progress in neutron diffraction 
has been seriously handicapped by the limited 
facilities available, but these facilities have been 
increased appreciably in recent years, and future 
researches will undoubtedly uncover many prob- 
lems equally as interesting as those which have 
been described. 


June Meeting of the AAPT 


The June Meeting of the AAPT will be held Monday, 
Tuesday, and Wednesday, June 28-30, at the University 
of Minnesota, Minneapolis, Minnesota. President-elect 
R. RONALD PALMER, Beloit College, is program chairman. 
The American Physical Society is holding a meeting con- 


currently. A special feature of this meeting (reserved for 
early risers) is a total eclipse of the sun, for which the 
duration of the total phase will be 1.2 minutes. Head- 
quarters for the meeting will be the Nicollet Hotel. 





Photonuclear Reactions 


ARTHUR I. BERMAN 
High Energy Physics Laboratory and Department of Physics, Stanford University, Stanford, California 
(Received August 21, 1953) 


The significant aspects of photonuclear processes are reviewed in terms of the trends in re- 
search since their discovery in 1934. Experimental techniques with high-energy x-radiation, 
which are currently employed in these investigations, are described. 


I. SUMMARY OF PRESENT KNOWLEDGE 
A. Introduction 


HE excitation and transformation of nuclei 
by the impact of photons forms an im- 
portant chapter in the study of nuclear phe- 
nomena. An understanding of the mechanism of 
these processes, unique in that electromagnetic 
rather than nuclear forces are involved, provides 
a valuable clue to the fundamental nature of the 
nucleus. 

Numerous types of reactions take place rang- 
ing from the re-emission of the excitation energy 
in the form of one or more quanta to the violent 
disruption of a metastable heavy nucleus into 
two or more fission fragments. Most frequently, 
however, photoexcited nuclei regain their sta- 
bility in the emission of a single neutron. The 
total cross section vs quantum energy function 
for all photoinduced reactions in a given nucleus 
is unusual in that it displays a pronounced 
resonance. For all elements this resonance is 
about 6 Mev in width and the peak energy lies 
in the 15—20 Mev range; this peak energy slowly 
decreases with atomic number while the cross 
section integrated over all energies increases. 
The approximate magnitude and shape of the 
cross section as a function of energy for a typical 
element, copper, is shown in Fig. 1, together with 
cross sections for the usual atomic interactions 
for comparison. 


B. History of the Experimental Programs 
1. Radioisotope Sources 


In 1934, the first reaction induced by quanta 
was attempted by Chadwick and Goldhaber,} 
who sought to disintegrate the simplest of all 


1J. Chadwick and M,. Goldhaber, Nature 134, 237 
(1934). 


nuclear systems, the deuteron, using 2.62-Mev 
gamma rays from ThC’. Their success, marked 
by the detection of the recoil protons and meas- 
urement of their energy, inaugurated a series of 
experiments with every available element which 
continues through the present in high-energy 
laboratories throughout the world. 

Immediately afterward, Szilard and Chalmers 
sought to detect neutrons, which they predicted, 
quite correctly, would be the most probable 
emanation from more complex excited nuclei.? 
This they did using radium gamma rays in the 
first photodisintegration of beryllium. 

The theory of the deuteron photodisintegra- 
tion by Bethe and Peierls? marked one of the 
notable triumphs of quantum mechanics in 
nuclear theory. It agreed with the measured 
threshold for the reaction, 2.17 Mev,‘ as well as 
the observed angular distribution. The existence 
of a gradually decreasing tail in the cross section 
vs energy curve was predicted. For beryllium, 
Guth and Mullin derived a theory of low-energy 


Fic. 1. Photon inter- 
action cross sections in 
copper. 


CROSS SECTION -BARNS 


10 40 100 
€ IN MEV 


2L. Szilard and T. Chalmers, Nature 134, 494 (1934). 

3H. Bethe and R. Peierls, Proc. Phys. Soc. (London), 
A148, 146 (1935). 

4A more recent value from several experiments is 2.22 
158 - discussed by E. E. Salpeter, Phys. Rev. 82, 60 

1951). 


LINEAR ABSORPTION GOEFFICIENT-CM 


277 





278 


photodisintegration.® This is treated essentially 
as a two-body problem as the neutron is removed, 
by direct photon interaction, from a tightly- 
bound Be® nucleus. These processes resemble the 
atomic photoeffect in the ejection of an orbital 
electron by an incident quantum, especially with 
respect to the angular distribution which is of 
the form 


nd8 = (a+b sin?6)dé, (1) 


where a and 3 are constants. 


2. Gamma Rays from Nuclear Reactions 


In the breakup of the deuterium and beryllium 
nuclei, gamma radiations from the natural radio- 
active elements exhausted their potentialities 
because of the limited energy available per quan- 
tum. In 1937, however, Bothe and Gentner uti- 
lized the artificially produced 17.6-Mev gamma 
ray of the Li’(p,y)Be® reaction to study the 
photodisintegration of a large number of ele- 
ments.® Their results were unexpected : some ele- 
ments exhibited huge cross sections, others none 
at all. This remarkable selectivity observed at 
this single energy challenged the new liquid drop 
model of the nucleus which earlier had been 
described by Bohr in order to explain the non- 
selective character of high-energy neutron colli- 
sions.? But Bohr was soon to point out that an 
analogy cannot easily be drawn between the 
inelastic impact of a massive high-energy neu- 
tron, and that of an electromagnetic wave 
packet ;° in the latter case the nucleus may be 
set into well-ordered, though violent, vibration. 


3. The Early Betatrons 


Since reasonably intense nuclear gamma rays 
of energies above that of nearly all photonuclear 
thresholds are virtually nonexistent (the main 
exception being the lithium 17.6-Mev line), 
further data about the energy dependence of 
photonuclear cross sections was unavailable. In 
1941, however, Kerst successfully constructed a 
betatron, and by the late forties 25-Mev beta- 
trons in many parts of the world were employed 
in the study of photonuclear reactions. 


5E. Guth and C. J. Mullin, Phys. Rev. 76, 234 (1949). 
6 W. Bothe and W. Gentner, Z. Physik 106, 236 (1937). 
™N. Bohr, Nature 137, 344 (1936). 
8N. Bohr, Nature 141, 326 (1938). 


ARTHUR 1. 


BERMAN 


The use of this machine invokes a severe 
handicap in the evaluation of data. Unlike 
workers in the low-energy x-ray field who make 
use of sharp line atomic spectra, to the high- 
energy experimenter only the continuous brems- 
strahlung is available. The betatron is suc- 
cessively operated at several electron energies. 
At each operation, a foil of the element to be 
studied is inserted in the beam, and the number 
of disintegrations induced by the bremsstrahlung 
is recorded as a function of the maximum quan- 
tum energy. From this data and the number of 
photons in the beam, as well as the energy 
available per photon, the cross section vs energy 
function may be evaluated. 

It was in this manner that Baldwin and 
Klaiber, in 1948, obtained for the first time cross- 
section curves exhibiting the characteristic reso- 
nant shape.’ Although they measured only the 
C®(y,2)C" and Cu®(y,2)Cu® reactions, this 
resonance appeared to be a general property of 
photonuclear reactions, for Perlman and Fried- 
lander, in preceding experiments, measured in- 
duced radioactivity from fifteen elements from 
C® to Re'®’, using 50- and 100-Mev bremsstrah- 
lung, and discovered no apparent increase in 
activation at the higher energy. Present evi- 
dence indicates that competing reactions in 
which protons, or groups of nucleons, are emitted 
do not alter the general tendency of the cross 
section summed over all reactions in an isotope 
to exhibit a resonant shape, although there is a 
possibility for the existence of a slowly decreasing 
tail. 

In addition to cross-section measurements, 
considerable data have been collected on the 
distribution in energy and direction of emitted 
particles, and also on the threshold energy for 
the reactions. 


4. The Later High-Energy Electron Accelerators 


At present, several betatrons, electron syn- 
chrotrons, and linear electron accelerators oper- 
ate at energies up to about 400 Mev. In experi- 
ments with bremsstrahlung in this range, an 
increase in the photonuclear cross section from 

*G. C. Baldwin and G. S. Klaiber, Phys. Rev. 73, 1156 
(1948). 


1M. L. Perlman and G. Friedlander, Phys. Rev. 72, 
1272 (1947). 





PHOTONUCLEAR REACTIONS 


about 150 Mev onward was observed, of the 
same order of magnitude as the cross section in 
the resonant region." There is speculation that 
the increase represents an interaction between 
the photon field and meson field of the nuclei, 
since this is the energy region for the appearance 
of the 7 meson. 

At present, the most important aspect of 
these high-energy experiments is the study of 
nuclear structure with respect to the groupings 
of the nucleons into subunits. A colliding photon 
of extreme relativistic energy is characterized 
more than ever as a particle and can produce 
pronounced direct interactions with nuclear sub- 
units, revealing much about their nature within 
the nucleus and their binding energies. 


C. Uses of Photonuclear Reactions 


1. Determination of Mass and Binding Energy 


The initial photonuclear reaction studied, 
H?(y,2)H!, provided, for the first time, an 
accurate determination of the mass of the neu- 
tron. Chadwick and Goldhaber estimated the 
energy of the recoil protons in their deuterium- 
filled ion chamber as 0.25 Mev. Then, from the 
known energy of the ThC’ gamma ray, 2.62 
Mev, together with the known deuteron and 
proton masses, they calculated the deuteron 
binding energy as 2.1 Mev and the neutron mass 
as 1.0081 amu, which are close to the most 
recent values. The photodisintegration of heavier 
nuclei reveals, in the same manner, the binding 
energy of the emitted particle as well as the mass 
difference of the neighboring isotopes. To avoid 
the necessity of measuring the kinetic energy of 
the emitted particle, the energy of the photon is 
lowered to the value at which the reaction barely 
proceeds and the particle is then emitted with 
practically zero energy. 


2. Energy Calibration of Electron Accelerators 


If the characteristics of a high-energy ac- 
celerator are such as to establish a known rela- 
tionship between the various operating energies, 
then a knowledge of a single operating energy 
will provide a means for calibrating the ac- 
celerator. To illustrate this (see Fig. 2), suppose 


uL,. W. Jones and K. M. Terwilliger, U. S. Atomic 
Energy Commission Document UCRL-1618 (1951). 


LE 


Fic. 2. A method for calibrating a betatron Mev meter. 
The current through the energizing coils a is controlled 
at panel b. The induced voltage c is proportional to the 
magnetic field at the electron orbit at the instant of x-ray 
target impact d. The bremsstrahlung activates the carbon e. 
The current in a is lowered until the carbon activation 
ceases. At this point the meter reading corresponds to a 


maximum quantum energy of 18.7 Mev, the threshold 
for the C!2(7,72)C" reaction. 


the energy of a 25-Mev betatron is controlled by 
adjusting the effective value of the alternating 
magnetic field strength of the betatron magnet, 
as is sometimes, though not usually, done. This 
value is proportional to the magnitude of the 
field in the orbit of the electrons at the instant 
they strike the target (assuming they are always 
permitted to strike when the alternating field is 
at the same point in its cycle). An ac voltmeter, 
coupled to the changing field, will record a de- 
flection which also is proportional to that 
instantaneous field. Furthermore, since the mag- 
netic field is in turn proportional to the electron 
momentum in the constant radius orbit, one can 
equate the voltmeter reading to the momentum. 
Thus, the relativistic kinetic energy equation for 
the electron 


T =[(cp)?+ (moc?)? }!#— moc’, (2) 


where T=electron kinetic energy, p=electron 
momentum, mo=electron rest mass, and c 
=velocity of light, is transformed into the beta- 
tron calibration equation 


hvmax = [(KD)?+ (moc*)? }!— moc’, (3) 


where fAymax=Maximum quantum energy of 
bremsstrahlung, D=voltmeter deflection, and 
K=calibration constant. 

Suppose the threshold of a particular reaction, 
e.g., C#(y,2)C", is known accurately from mass 
spectroscopic measurements and beta energy 
data as 18.7 Mev. A carbon sample is then placed 
in the betatron x-ray beam and the resultant 
20-minute carbon-11 activity is recorded. The 
magnetic field is then lowered to an effective 
value at which the activity decreases to zero so 
that hymax now equals 18.7 Mev. The calibration 





280 ARTHUR I. 


constant is then found in terms of the voltmeter 
reading for this energy from Eq. (3). 


3. Source of Neutrons 


Electron accelerators provide a copious source 
of neutrons with a wide energy spectrum by 
photonuclear reactions produced either in the 
x-ray target of the machine or in a region 
under x-ray exposure. This technique may pro- 
duce neutrons in regions accessible only to highly 
penetrating x-rays. Photoneutrons may also be 
produced by fabricating a beryllium block around 
a strong gamma emitter. The Be®(y,7)Be® re- 
action provides essentially monokinetic neutrons 
of energy (8/9) (hv—er), er being 1.63 Mev, the 
reaction threshold. 


4. Threshold Detectors 


A convenient way to isolate a part of the 
bremsstrahlung in order to work with a quasi- 
monochromatic high-energy and high-intensity 
photon source is through the use of the sharp 
low-energy cutoff property of photonuclear re- 
actions. The accelerator is operated at an energy 


slightly above the threshold energy of the re- 
action. When the bremsstrahlung energy den- 
sity, J, (the number of photons per quantum 
energy range e to e+de times the energy per 
photon), is multiplied by the cross section «, 
then an effective intensity distribution in a 
narrow energy band, as shown in Fig. 3, is pro- 
duced. One may measure, for example, the 
photon absorption coefficients through various 
absorbers in this energy region. ‘‘Good geometry” 
will prevail without the need for collimators. 





QUANTUM ENERGY -€ 


Fic. 3. Limiting the effective intensity of brems- 
strahlung to a narrow energy band by detecting the 
radiation with a photonuclear reaction. 


BERMAN 


5. Knowledge of the Nucleus 


If one were to state the most fundamental 
value of the study of photonuclear reactions, it 
would be the broadening of our knowledge of 
nuclear structure and dynamics in the interpre- 
tation of the hundreds of experiments that have 
taken place during the past two decades. 


D. Mechanism of the Reaction 
1. The Collective Model of the Nucleus 


In the study of atomic phenomena it is cus- 
tomary to analyze interactions in terms of 
specific electrons. The noninteracting electrons 
merely contribute to the average field through 
which the electron, upon which the excitation 
energy is concentrated, moves. Likewise, in 
light nuclei, the individual nucleons may be 
considered to move through the relatively large 
volume of the nucleus perturbed only slightly 
by the fields of their neighbors; in analogy with 
the atom, a closed-shell model of the nucleus 
may be given. However, in the case of complex 
nuclei, an alternate model may be described 
whereby a configuration in which a given amount 
of energy is concentrated on any particular 
nucleon is unlikely. According to this theory, 
what had been the occasional encounter between 
nucleons now becomes the normal description of 
nuclear behavior; and, in the continual rapid 
transfer of energy from particle to particle, the 
states of the individual nucleons lose their sig- 
nificance in favor of a compound nuclear state 
based on this collective motion. 

Like many statistical interpretations, this 
model arises out of the extreme complexity in- 
volved in shaping gross behavior from the 
detailed analysis of individual interactions. It 
thus avoids the many-body problem of quantum 
mechanics and resorts instead to probability 
laws; and, like any statistical interpretation, 
the validity increases with the number of in- 
dividual members which compose the integrated 
whole. 

In some respects an analogy can be drawn 
between the dynamics of this collective model, 
as described by Bohr, and that of a liquid drop. 
For example, the internal forces are short range 
and show saturation effects. The transfer of 
energy throughout the constituents is considered 


~ meananrhm se AL KH lUmlUwBHlCU 





PHOTONUCLEAR REACTIONS 


as a ‘‘thermal agitation”’ resulting in the eventual 
evaporation of one or more particles. 


2. Formation of a Compound Nucleus 


The capture of fast neutrons offered the first 
experimental evidence for the unusual behavior 
of excited nuclei characterized by the collective 
model. Gamma rays were emitted by the ex- 
cited nucleus of a sharpness corresponding to a 
duration of the excited state of several orders of 
magnitude longer than the transit time of a 
neutron across a space equal to the nuclear 
diameter. Also, unlike the extreme resonance 
exhibited in the capture of slow neutrons in the 
region of well-defined energy levels, fast neutron 
capture shows no selective character. 

Bohr explained these facts by describing the 
excited state of the collective nucleus as that in 
which the captured neutron distributes its large 
energy almost immediately among the closely- 
packed nucleons as a consequence of their in- 
numerable collisions. The combination of the 
quantized motions of each nuclear particle in 
the confined space of the nucleus determines the 
eigenstate of the system. As the possibilities of 
combination rapidly multiply with excitation 
energy, the spacings between energy levels grow 
smaller, and the level widths, being finite, actu- 
ally overlap each other. The several simultane- 
ous modes of excitation soon disintegrate, and 
the nucleus ends its existence in the expulsion 
of one or more particles. From the statistical 
point of view, a portion of the incident energy, 
shared by the nucleons, may, as a random 
fluctuation, be localized on a surface nucleon in 
an amount sufficient to expel it. 

In the case of photon capture, the localized 
incident energy might be expected to rapidly 
distribute itself through the aggregate of colliding 
particles to form a compound nucleus in an 
analogous manner ; however, an exact parallelism 
with fast neutron capture cannot be assumed. 


3. Absorption Processes 


The resonance characterizing photonuclear 
absorption occurs at an energy region which, 
according to the compound nucleus theory, is 
not identified with sharp widely spaced levels. 
Goldhaber and Teller attempted to explain this 


281 


paradox by proposing, in 1948, a radical absorp- 
tion mechanism.” They assumed that near 20 
Mev in every nucleus (the actual energy value 
decreasing slowly with mass number), there 
exists the first excited state of what they call 
the ‘‘dipole vibration.”’ Incident photons of the 
resonant energy excite a motion in which the 
protons separate en masse from the neutrons. 
The strong forces which exist between the neu- 
tron and proton fluid account for the high fre- 
quency involved in this excitation. Because of 
the complexity of motions associated with a 
highly-excited compound nucleus, one expects 
the resonance to have a breadth of several Mev 
as energy is transferred from the orderly dipole 
vibration into other modes of nuclear motion. 
The energy levels of the excited nucleus, still 
numerous and overlapping, would cluster around 
the energy state of the dipole vibration. Gold- 
haber and Teller calculated, with this model, 
both the resonance energy for photonuclear 
absorption and the cross section integrated over 
all energies, in approximate agreement with 
experiment. Alternatively, Steinwedel, Jensen, 
and Jensen derived cross section formulas using 
hydrodynamic methods with a proton-neutron 
interpenetrating fluid model." 

Levinger and Bethe avoided the necessity for 
a special model by calculating the photoelectric 
dipole transitions of the individual nucleons and 
summing over all nucleons taking account of the 
effect of the neutron-proton exchange forces." 
Their result for the photonuclear cross section, 
integrated over all photon energies, is about half 
again as large as that calculated by Goldhaber 
and Teller. The mean energy for photon absorp- 
tion was also calculated. A separate computation 
of electric quadrupole absorption led to the con- 
clusion that the large photonuclear resonance 
can be accounted for only by a dipole effect, i.e., 
a net displacement of center of charge with 
respect to center of mass within the nucleus. 

At photon energies below that of the large 
electric dipole resonance region, however, one 
may expect electric quadrupole and magnetic 


( 12 3). Goldhaber and E. Teller, Phys. Rev. 74, 1046 
1948). 

13 Steinwedel, Jensen, and Jensen, Phys. Rev. 79, 1019 
(1950). 


4 J. S. Levinger and H. A. Bethe, Phys. Rev. 78, 115 
(1950). 





ARTHUR I. 


Fic. 4. Possible photo- 
nuclear reactions in a 
given nucleus where cross 
sections for the emission of 
protons, alphas, and deu- 
terons are small enough to 
be omitted. At the ener- 
gies at which multiplicity 
occurs, €: and ¢;, the rise 
in each reaction partially 
accounts for the fall in the 
preceding one. The total 
(dotted) is still of a reso- 
nant shape. 


a (%n) 


dipole absorption to be significant.’ The effect 
could be interpreted as a redistribution of the 
nuclear fluid, in the form perhaps of surface 
waves on the liquid drop model, in contrast to 
the more energetic action present in electric 
dipole absorption where the powerful neutron- 
proton binding forces come into play. Careful 
cross-section measurements near the threshold 
of several photonuclear reactions, revealed the 
existence of a maximum in the cross section curve 
apart from the resonance maximum.!* (This is 
suggested in Fig. 1 ; however, fine structure which 
may be present in separate reactions is usually 
indistinct in the cross section curve summed for 
all photoinduced reactions in an element.) 


4. Testing the Compound Nucleus 


Bohr’s conception that energetic reactions in 
medium and heavy weight nuclei proceed through 
the formation of a quasi-stable compound nu- 
cleus is a description of nuclear structure which 
lends itself well to experimental test. 

Of primary significance is the lack of correla- 
tion between the formation and disintegration 
of the compound nucleus. In the rapid distribu- 
tion of captured energy and its final chance 
localization, through a system which is loosely 
coupled, the phase relationship between the 
incident and outgoing particle is essentially 
destroyed. Thus, the distribution of emitted 
particles should be isotropic. 

The energy distribution of these particles 
should follow the analogy with molecular evapo- 


16 J. M. Blatt and V. F. Weisskopf, Theoretical Nuclear 
Te (John Wiley and Sons, Inc., New York, 1952), 


16 Horsley, Haslam, and Johns, Can. J. Phys. 30, 159 
ost. Stephens, Halpern, and Sher, Phys. Rev. 82, 511 
1951). 


BERMAN 


ration and indeed resemble a Maxwellian dis- 
tribution corresponding to a nuclear temperature 
which is a function of the excitation energy. 

This temperature should, furthermore, deter- 
mine the multiplicity, or the number of particles 
emitted per nucleus. Furthermore, the cross 
section for the emission of a single neutron 
should begin to decrease at the threshold energy 
for the emission of two neutrons (see Fig. 4). 

Finally, according to this collective, or sta- 
tistical model, the evaporation of protons should 
practically be insignificant in medium and 
especially in heavy nuclei because of the diffi- 
culty of penetrating the large Coulomb potential 
barrier surrounding the nucleus. 

The first disagreement with this model was 
obtained by Hirzel and Waffler who, by measur- 
ing (y,”) and (y,p) cross sections for neighboring 
isotopes, were able to estimate the ratio a(y,p)/ 
o(y,n) for light and medium weight nuclei.!” 
They found the ratio to fluctuate around unity 
for light elements and show a general decrease 
with Z from 0.05 at Z=30 to about 0.02 at 
Z=50. These ratios, despite their small values, 
still are 100 to 1000 times larger than those 
calculated using the statistical model of Weiss- 
kopf and Ewing. In an attempt to explain the 
unusually large proton emission for medium 
weight nuclei, later confirmed by betatron experi- 
ments, Schiff modified the statistical theory by 
emphasizing the distinction between a compound 
nucleus formed by photon impact in contrast 
to that formed by neutron impact.” In the 
former case, the compound nucleus is described 
by a relatively small number of modes of oscilla- 
tion as the electromagnetic wave interacts with 
each proton in a like manner in passing over the 
nucleus. Therefore, fewer energy states are 
available to the compound nucleus (or to the 
residual nucleus after neutron and proton emis- 
sion), especially at high energies, in comparison 
to a similar compound nucleus which would be 
formed by particle capture in the reverse process. 
Thus, relatively more high-energy particles are 
emitted, in order not to favor the formation of 
highly-excited residual nuclei. As a result, more 


17Q. Hirzel and H. Waffler, Helv. Phys. Acta 20, 374 
(1947). 


18V. F. Weisskopf and D. H. Ewing, Phys. Rev. 57, 
472 ee 


L. I. Schiff, Phys. Rev. 73, 1311 (#948). 





PHOTONUCLEAR REACTIONS 


TABLE I. Descriptive models to describe photonuclear processes. 


Compound nucleus 


Ewing-Weisskopf 
Total cross section 


o(y, G+1)n). 


a(y,p)/o(y,n) ~10~ to 10-5 for 


Z>30 Z>30 


Angular distribution 


f Isotropic 
of emitted particles 


Energy distribution 


: Maxwellian; medium 
of emitted particles 


low energies 
predominate 


= L. Eyges, Phys. Rev. 86, 325 (1952). 
> E. A. Whalin and A. O. Hanson, Phys. Rev. 89, 324 (1953). 
eR. Sagane, Phys. Rev. 83, 174 (1951), and 84, 586 (1951). 
4Q. Hirzel and H. Waffier, Helv. Phys. Acta 20, 374 (1947). 

B. C. Diven and G. M. Almy, Phys. Rev. 80, 407 (1950). 
H. L. Poss, Phys. Rev. 79, 539 (1950). 
A. Lundby and L. Marshall, Phys. Rev. 83, 323 (1951). 
Curtis, Hornbostel, Lee, and Salant, Phys. Rev. 77, 290 (1950). 
P. R. Byerly and W. E. Stephens, Phys. Rev. 83, 54 (1951). 
W. A. Butler and G. M. Almy, Phys. Rev. 91, 58 (1953). 
M. E. Toms and W. E. Stephens, Phys. Rev. 82, 709 (1951). 
D. H. Wilkinson and J. H. Carver, Phys. Rev. 83, 466 (1951). 


e 
{ 
- 
b 
i 
i 
k 
1 


protons may penetrate the potential barrier, 
and the difficulty with experiment tends to be 
resolved. 

Observations of the angular distribution of the 
emitted particles offered additional evidence to 
question the compound nucleus theory. The 
conclusion of many experiments was that al- 
though low-energy particles were emitted iso- 
tropically, the high-energy ones definitely favored 
ejection at right angles to the incident beam. 
This fact led Courant and others to propose, 
following the deuteron photoeffect, that a nu- 
cleon near the surface of a heavy nucleus may be 
ejected by a direct interaction with the incident 
photon (or possibly a neutron as well) without 
the prior formation of a compound nucleus.” 
The copious emission of protons could also be 
explained by this direct process. If this theory 
was correct, one would expect the total cross 
section curve to have a long, slowly falling, 
high-energy tail rather than the usual resonant 


2E. D. Courant, Phys. Rev. 82, 703 (1951); C. Levin- 
thal and A. Silverman, Phys. Rev. 82, 822 (1951). 


Schiff 
Resonance shape permitted. Decrease in 
o(y,jn) partially compensated by increase in 


107 to 107? for 


Essentially isotropic 
with possible 
antisotropy 


Higher energies 
predominate 


Direct ejection Experimental evidence 


Courant 
Slow decrease 
at high 
energies 


Several (7,2) maxima 
coincide with (y,2) 
thresholds,* especially 
for Ta.> Long high- 
energy tail observed.° 


=5 X10 to 107 for 
Z>30.4 
= 10-* for Ag.e 


Fast m mainly at 90° 
from Pb, Bi, W.£ Low 
energy ? isotropic, high- 
energy mainly at 90° 
from Ag,° Al,# Rh.® 


p dist. Maxwellian with 
slight high-energy ex- 
cess, from Ag,° Cu, 
Mo,i and Mg.* 

p dist. with extremely 
low energies predomi- 
nate, from A.! 


ngdé = 
(a+b sin*6)dé@ 


Very high 
energies 
predominate 


shape. There is some evidence for this tail in 
the measurements made by Sagane on the 
C®(y,n) and Cu®(y,n) cross sections,”! but this 
is in disagreement with results of several other 
experimenters. 

Studies of the energy distribution of the 
emitted particles have also been made in an 
effort to test these conflicting theories, for the 
direct process would tend to eject particles with 
very high energies in comparison with the 
evaporation process. ‘ 

The compound nucleus model cannot be taken 
for granted even for other types of nuclear reac- 
tions, although there is much evidence to favor it; 
for example, Ghoshal” formed the excited nucleus 
Zn* in two different ways—by bombarding Ni® 
with alphas and Cu® with protons—and dis- 
covered that the relative numbers of n, 2m, and 
pn emissions which took place were independent 
of the mode of formation. On the other hand, 
there is some evidence to doubt the validity of 


21 R. Sagane, Phys. Rev. 83, 174 (1951); 84, 586 (1951). 
22S. N. Ghoshal, Phys. Rev. 80, 939 (1950). 































































































284 


the compound nucleus model, and instead favor 
a shell model, in which the nucleons do not 
interact as strongly.” 


5. A Compromise Nuclear Model 


Present experimental data often are conflicting 
and inadequate to permit a very complete 
understanding of the mechanism of photonuclear 
reactions and the associated nuclear model. In 
Table I a summary of some of the experimental 
evidence for a particular reaction mechanism in 
medium and heavy nuclei is outlined. These 
experiments are concerned with photon energies 
on the order of tens of Mev. 

Courant pointed out” that the compound 
nucleus model, favored by some experiments, 
could be partially retained although still assum- 
ing that the essential interaction is a direct one. 
Surface protons which interact, he maintains, 
could, more likely than not, remain in the nu- 
cleus, transfer their energy to the other con- 
stituents, and form a compound nucleus which 
would decay most probably by neutron emission. 
He proposed also an alternate process: as a 
compound nucleus is being formed (regardless of 
the mechanism of formation), before the excita- 
tion spreads over the entire nucleus, a “local 
heating’”’ may take place which “boils out” 
high-energy nucleons, permitting the protons to 
penetrate the Coulomb barrier. 

Bethe took into consideration the strong di- 
pole absorption near 20 Mev in favoring an 
alpha-particle model of the nucleus as a first 
approximation." For while there is correlation 
between neutrons and protons at lower energies 
in the vibration of the whole alpha particle 
within the nucleus, near 20 Mev the first excited 
state of the alpha particle occurs and strong 
dipole absorption would be present in its dis- 
ruption. Following Courant, the disruption may 
take the form of direct proton ejection from the 
alpha particle. If this occurs near the nuclear 
surface, the proton may be entirely removed 
from the nucleus; on the other hand, protons 
ejected from alphas in the nuclear interior may 
collide with other nucleons and alphas, forming 
an excited compound nucleus which can decay 
in the usual manner." 


( _— Porter, and Weisskopf, Phys. Rev. 90, 166 
1 ’ 


ARTHUR f. 









BERMAN 








Il. EXPERIMENTAL TECHNIQUES 
A. Obtaining the Data 
1. Radiation Used 


The source of monoenergetic quanta in photo- 
nuclear experiments with all elements beyond 
beryllium has been the 17.6-Mev line of the 
Li’(p,y)Be® reaction almost exclusively. This 
reaction produces an additional line at 14.6 
Mev of about half the intensity and twice the 
width.” The effect of the lower line is negligible 
for most light elements for it lies below the 
threshold of nearly all of their photonuclear 
reactions; but for medium and heavy elements, 
it may lie at the maximum of the o(y,7) curve. 
This fact may be used to advantage since the 
o(y,2n) cross section lies near 17.6 Mev for 
heavy elements, and a useful measurement of 
o(7,27)17.6 Mev relative to o(y,”)14.6 Mev May be 
made.?5 

A method for selecting monoenergetic quanta 
from bremsstrahlung has been developed at 
Cornell.?® It consists in establishing coincidence 
between the recoil electron and an event induced 
by the bremsstrahlung quantum. The energy of 
the quantum is the difference between that of the 
initial and recoil electrons. It has been used 
apparently with success.?” In most photonuclear 
experiments, however, the entire bremsstrahlung 
is utilized and an energy dependence is estab- 
lished in the cross section measurements by 
successively lowering the maximum energy. It 
is this procedure which is discussed in the section 
below. 

2. Recording the Activation 


One method for determining the number of 
disintegrations which take place under a given 
flux of photons, is to observe, by means of a 
Geiger or scintillation counter, the decay of the 
residual nucleus, assuming that it is radioactive. 
The total number of disintegrations which take 
place (as calculated from the observed decay 
rate) is plotted against the maximum quantum 


| 24 > L. Walker and B. D. McDaniel, Phys. Rev. 74, 315 
1948). 
( se Edge, and Wilkinson, Phys. Rev. 89, 658 
1953). 
26 J. W. Weil and B. D. McDaniel, Phys. Rev. 86, 582 
(1952). 


27J. W. Weil and B. D. McDaniel, Phys. Rev. 90, 380 
(1953). 








PHOTONUCLEAR REACTIONS 





energy of the bremsstrahlung. This reaction 
yield is proportional to the number of quanta of 
a given energy times the reaction cross section 
at that energy, integrated over all quantum 
energies present in the beam. This may be 
written 


‘u 
Y(eu) = Nn f Peco de, (4) 


where Y(ey) =Total reaction yield, N = Number 
of nuclei per cm? in the sample exposed to the 
beam, »=Total number of photons striking the 
sample, Pe, =Probability that a photon is 
emitted with energy e, where the maximum 
energy possible is ey, ¢.= Reaction cross section 
in cm? at e. 

The theoretical number of photons in the 
bremsstrahlung is infinite since the number of 
low-energy photons increases without limit as 
their energy approaches zero. In practice, the 
rapid absorption of low-energy quanta permits 
Eq. (4) to be written as shown. The value of n, 
which is intended solely as a beam strength 
factor, is always given in terms of measurable 
quantities as shown below; furthermore, diver- 
gence of the integral is avoided by the finite 
lower limit which is set. 

To obtain an absolute value of the cross sec- 
tion, care must be taken to determine the total 
reaction yield by noting factors such as counting 
efficiency, activity half-life, proportion of dis- 
integrations unrecorded because of K-electron 
capture or absorption of the decay particle, etc. 

Major difficulties which are present are (a) 
the residual nucleus may be stable; (b) an 
activity may be encountered of excessively short 
or long lifetime; (c) the decay products of a 
particular reaction may conflict with that of 
other reactions from the same or different iso- 
topes in the absorber; (d) the decay products 
may be of low energy, or the decay scheme may 
be in doubt, so that an unknown fraction of 
emitted particles is counted. 

To avoid some of these problems, the yield of 
particles from the photoexcited nucleus often is 
measured, rather than from the residual nucleus. 
This has the further advantage that the energy 
and angular distributions of the emitted par- 
ticles may be measured. The technique suffers 
from two disadvantages, however, which seri- 


285 





ously limit its usefulness: (a) it is difficult to 
separate the huge background of spurious radia- 
tions which appear during accelerator operation 
from that of the particles which emanate from 
the sample; (b) it is impossible to determine the 
number of particles emitted per nucleus, e.g., 
the existence of (7,27) or (7,37) cross sections 
might be misinterpreted as a larger (y,”) cross 
section. Neutrons are detected by the various 
methods in use, usually paraffin-surrounded BF; 
counter for slow neutrons, or a foil of an element 
with a high-energy neutron activation threshold 
for fast neutrons. 

Very often photonuclear reactions are studied 
by observing the individual event in a cloud 
chamber or nuclear emulsion. In this manner, 
useful data on the angular and energy distribu- 
tions of the emitted particles are obtained. A 
different method for measuring a _ charged- 
particle energy distribution is due to Wilkinson 
and Carver*® who irradiated argon in a propor- 
tional counter with 17.6-Mev gamma rays and 
measured the proton energies by pulse-height 
analysis. 

Direct yield measurements may be combined 
with radioactive yield measurements to obtain 
data on the cross section for multiple emission. 
In this manner Whalin and Hanson” measured 
the Ta(y,27) cross section by subtracting out the 
(y,”) activation curve from the activation curve 
of total neutron yield while operating below the 
theoretical threshold of the (y,37) reaction. 


B. Interpreting the Data 
1. Normalizing the Incident Flux 


In order to obtain curves of absolute cross 
section, care must be taken that (a) a known 
fraction of the total reaction yield is measured, 
(b) the number of nuclei under irradiation is 
known, and (c) the number of photons bom- 
barding the sample in each quantum-energy 
range e to e+de, at each operating energy em, 
is known. The difficulties involved in the abso- 
lute counting of beta particles from the residual 
radioactive nuclei, or particles from the photo- 
excited nuclei, are numerous. The problem of 


28D. H. Wilkinson and J. S. Carver, Phys. Rev. 83, 
466 (1951). 

22E. A. Whalin and A. O. Hanson, Phys. Rev. 89, 324 
(1953). 















































286 






measuring the number of nuclei per cm? exposed 
to the beam is relatively simple ; however, if the 
beam traverses only a small area of nonuniform 
foil, the chance of error increases. Of greatest 
difficulty is the determination of the photon 
energy distribution in the bremsstrahlung abso- 
lutely at the varied operating energies of the 
accelerator. 

One usually measures the total energy in the 
beam and then, from the theoretical-empirical 
spectral distribution, calculates the photon den- 
sity per energy increment. The defining equation 
for total energy is 





‘mM 
u=n{f P.mede. (5) 
0 
Equation (4) may now be written 
«Mu 
vf Pejeyo de 
Y(em) €T 
- (6) 


U ow 
f Pejey ede 
0 


The spectral distribution P.,.,, has been calcu- 
lated for thin targets by Bethe and Heitler in- 
tegrated over all directions; Schiff obtained 
the distribution in terms of the direction of 
emission, notably the forward direction.*! This 
has been verified experimentally at several 
values of ey.** It must be noted that the calcula- 
tions were made using the Born approximation 
which is invalid at quantum energies near 0 
and €mu.- 

An exact measurement of U is difficult to 
make and therefore one is often content to 
measure relative cross sections, obtained by 
replacing U in Eq. (6) by a measurable pa- 
rameter which is a function of it. This is usually 
the roentgen, or ionization produced by the 
beam in 1 cm* of air at standard conditions. In 
the measurement, one tries to establish spectral 
equilibrium, where the number of ions produced 
in the chanber is proportional to the intensity 
striking the chamber wall. This cannot be done 


%® H. A. Bethe and W. Heitler, Proc. Roy. Soc. (London) 
A146, 83 (1934). 

3 L. I. Schiff, Phys. Rev. 83, 252 (1951). 

3H. W. Koch and R. E. Carter, Phys. Rev. 77, 165 
(1950); J. DeWire and L. Beach, Phys. Rev. 83, 233 
(1951); R. H. Stokes, Phys. Rev. 84, 991 (1951). 





ARTHUR I. 








BERMAN 


effectively at all operating energies, since the 
bremsstrahlung produces secondary electrons of 
various ranges; the inevitable result is a number 
of incorrect roentgen readings. 

One can also monitor the beam accurately by 
determining the relative intensity, from run to 
run, in a finite energy interval. If the photon 
sensitivity of a detector in an energy band 
€a <€ <e€y is ¢, then the response, within the 
band, of the detector is 


€b 
Res=kin f Preeyode. (7) 
If one stipulates further that the band lies well 
below ey and is sufficiently narrow, then the 
shape of Pe,e4 within the band will be essentially 
independent of ey. Consequently, the integral 
may be assumed constant for a range of beam 
strengths and operating energies; the beam 
strength 2 may then be eliminated in Eq. (4) 
to obtain 

Y (em) 


eM 
=i f Peeuo de. 
Ra eT 


The pair spectrometer, in magnetically sorting 
out the energies of pair-produced electrons from 
the bremsstrahlung, may be used in this manner. 
Furthermore, if many such energy intervals are 
resolved, a value for the spectral distribution is 
also found, independent of theoretical calcula- 
tions, using Eq. (7). 

The yield of a photonuclear process also can 
serve to normalize the beam. Two methods are 
considered here. In the first, due to Sagane,”! a 
a reaction is chosen as monitor whose cross sec- 
tion is concentrated at an energy well below 
that of the reaction being studied, as shown in 
Fig. 5. The resonance characteristic of photo- 
nuclear reactions permits the use of Eq. (8) 
where ¢, of Eq. (7) is the cross section of the 
monitor reaction. In the second method, devised 
by Newkirk and Laslett,** a mechanical oscillator 
slides alternately a monitor and test sample into 
the bremsstrahlung. Simultaneously, the operat- 
ing energy is switched between a given value 

’ and the particular ey of the run with the 





(8) 


€u 
sample (see Fig. 5). The monitor sample is thus 


3% L. L. Newkirk and L. J. Laslett, U. S. Atomic Energy 
Commission document ISC-183 (1951). 








always irradiated under the same conditions; 
deviations which appear in the monitor counts 
will be a consequence of either random or long- 
period intensity (m) changes of the accelerator 
which will affect the yield of the monitor and 
test samples in the same proportion. The monitor 
yield is written as 


éu’ 
y'=kan f Prejey'o (de=k4n, (9) 


where the primes indicate parameters for the 
monitor. Substituting for ” in Eq. (4), one 
obtains Eq. (8) where Y’ replaces Rap. 


2. Measuring the Incident Flux 


Several methods have been used to measure 
the total bremsstrahlung energy falling on the 
target nuclei in order to make absolute cross- 
section measurements. 

The Saskatchewan betatron group interpreted 
the ionization produced in a thimble-type cham- 
ber in terms of the actual energy expended in 
producing each ion pair within the chamber, 
based on the formula of Gray.* This calculation 
is somewhat crude and is hindered by the in- 
ability to measure properly the total ionization 
produced by high-energy bremsstrahlung. 

An improved method was devised by Blocker, 
Kenney, and Panofsky, who consider the theo- 
retical number of electrons produced by high- 
energy bremsstrahlung in passing through a 
thin foil.2® The essentials of this “balanced 
converter’ method are as follows: the ionization 
from two foils of different Z but the same number 
of electrons per cm? inserted into the beam is 
measured separately and compared. The effect 
of Compton electrons will be the same for each, 
while that due to pair electrons is different, for 
there is a marked inequality in the number of 
pair electrons created in the two foils. (The 
relative number of photoelectrons may be com- 
pletely neglected at Mev energies.) The pair 
production per atom can be written 


p=Z(Z+1)p.. (10) 
% Johns, Katz, Douglas, and Haslam, Phys. Rev. 80, 
1062 (1950). 


35 Blocker, Kenney, and Panofsky, Phys. Rev. 79, 419 
(1950). 


PHOTONUCLEAR 








REACTIONS 








€a €p Eu, SugSu Sus Sug 
QUANTUM ENERGY 


Fic. 5. Two methods for normalizing reaction yield 
data to determine the photonuclear cross section o, in an 
unknown sample irradiated by bremsstrahlung at several 
maximum energies €y,, €4,°*:. In the first, the resonance 
property of the monitor reaction ¢, is used to measure the 
relative number of photons within a narrow energy band. 
In the second, the yield under different operating condi- 
tions is continually referred to a standard yield at a fixed 
maximum energy ey’. 


This includes triplet production (pair produc- 
tion in an electronic field). In Eq. (10), » is a 
function essentially of quantum energy (zero 
for «<1.02 Mev) as shown in Fig. 1, with a 
slowly varying Z dependence which is more 
pronounced at very high quantum energies.** 
The ionization expected per cm? of foil A will be 


T4=C+2nN.I(Za+1) f “Peeypde, (11) 
0 


where C=ionization due to Compton electrons, 
N.=number of electrons per cm? in the foil 
(=NZ), and J;,=ionization expected theoreti- 
cally from a single electron (ionization per unit 
length near minimum ionization times the 
chamber thickness #). The factor 2 arises from 
the fact that two electrons are created in each 
photon encounter with an atom. The ionization 
difference between the two foils is 


«mM 
Ta-In=2nNIi(Za—Ze) { Peeyp de. (12) 
0 


From the measured ionization difference and the 
calculated integral the beam factor is found 
and may be inserted into Eq. (5) to yield the 
total energy. If greater accuracy is desired, the 
slight Z dependence of p. may be taken into 
account and the integral in Eq. (11) may be 





36 A. i. Berman, Phys. Rev. 90, 210 (1953). 




































































288 


computed for each foil before subtracting the 
ionizations. A slightly modified form of Eq. (12) 
will be obtained. Sometimes the beam intensity 
is identified by an effective number of quanta. It 
is defined by Blocker, Kenney, and Panofsky*® as 


Q=U/eu( =n f "Pim / f “ae). (13) 


An alternate method for finding U was also 
given by these authors, and consists of measur- 
ing the total ionization in a material caused by 
the initial flux. This will include the many 
multiple processes which may take place until 
all the mean energy eventually is converted. 
The accuracy is limited by (a) the slow but un- 
accountable variation of electron specific ioniza- 
tion with electron energy, and (b) the fraction 
of electrons diverted radially and escaping cap- 
ture by the chamber. The ionization loss in the 
chamber of thickness ¢ is measured at various 
depths x of the test material. The total energy is 


(14) 


U= f *(—dB/de) (I-/T)de, 


where —dE/dx=stopping power, or specific 
ionization, of electrons of energy E in the ma- 
terial under study, and J,=ionization measured 
at depth x. 


The intensity of bremsstrahlung may be 
measured by making use of known photonuclear 
cross-section data and recording the total yield. 
The intensity is then found by means of Eq. (4). 
If desired, the value of » so obtained may be 


CONVERTER SAMPLE FO! 
ELECTRONS a] «DEFLECTING FIELD 


FROM 
ACCELERATOR ™ 


ELECTRON 
a. COLLECTOR 


b. 


Fic. 6. Alternate arrangements for measuring the in- 
cident electron beam in photonuclear experiments with 
the linear accelerator. In (a) the unconverted electrons are 
deflected out of the x-ray beam and collected in a Faraday 
cup. In (b) the collector may be placed close to the con- 
verter in order to capture the scattered electrons. A second 
foil precedes the converter in order to measure, and permit 
a correction for, the effect of electron-induced reactions in 
the sample foil. 


ARTHUR I. 


BERMAN 


inserted in Eq. (5) to obtain the total energy. 
This method was used by Krohn and Shrader*? 
who observed photoproton tracks in D,O-loaded 
emulsions and used theoretical and experi- 
mental data on the photodisintegration of the 
deuteron. Measurements of the angular dis- 
tribution and length of the tracks provided an 
independent measurement of Pe,ey. 

The pair spectrometer may be used to measure 
the beam energy. If the thickness of the foil in 
which the pairs are produced is known, in addi- 
tion to the empirical-theoretical values of the 
pair production cross section, and the counting 
efficiency, then one may standardize the spec- 
trometer counting rate against beam intensity. 

A sixth method for measuring the energy in 
the bremsstrahlung employs the calorimeter.** 
While this is quite successful in low-energy 
work, at high energies the problem of absorbing 
the very penetrating quanta—even by a thick 
section of lead, which consequently yields only 
a slight temperature rise—makes accurate meas- 
urements exceedingly difficult. 

A technique for ascertaining the beam energy 
which promises to permit photonuclear cross 
section measurements of excellent precision is 
based on a measurement of the intensity of the 
electrons which produce the bremsstrahlung. In 
circular accelerators it is difficult to provide a 
well-focused electron beam uncontaminated by 
gamma rays or nuclear particles, for measure- 
ment. This method, therefore, is well-suited for 
use with linear accelerators. The electrons might 
be measured by inducing a high-frequency volt- 
age in an internal probe in the accelerator tube; 
however, this involves the solution of many 
technical difficulties. Alternatively, one might 
capture the transmitted electron beam in a 
Faraday cup device, called a collector, which 
accumulates a net measurable charge. 

Figure 6 illustrates two possible arrangements 
for collecting photonuclear data. In the top view, 
the electrons are diverted into the collector, 
away from the bremsstrahlung produced within 
the converter foil. Many electrons are lost by 
scattering and will be unrecorded. To alleviate 


87 V. E. Krohn and E. F. Shrader, Phys. Rev. 87, 685 
(1952). 


38 J. S. Laughlin and J. W. Beattie, Rev. Sci. Instr. 22, 
572 (1951); P. D. Edwards and D. W. Kerst, Rev. Sci. 
Instr. 24, 490 (1953). 















this, the collector in the bottom view is placed 
close to the converter; however, the primary 
electrons entering the sample foil will be re- 
sponsible for some error by inducing reactions 
directly, through their electromagnetic field. 
Although the cross section for these processes is 
only about one percent of that for photon- 
induced reactions, the number of electrons enter- 
ing the sample foil is still many times that of 
high-energy bremsstrahlung photons and so the 
yield due to both processes is of the same order 
of magnitude. A second test foil, preceding the 
converter, provides a measure of the fraction of 
reactions induced by the electrons alone. In the 
calculation of this correction, one must know 
the ratio of the photon-to-electron-induced re- 
action cross section, available from measure- 
ments of Brown and Wilson on foils of various 
elements. Care must be taken to include the 
effect of partial conversion of electron energy 
into bremsstrahlung within the sample foils 
themselves. 

Experiments in which the electron beam is 
extracted have an added advantage in the 
ability to define the thickness of the converter, 
and hence the spectral distribution corresponding 
to this thickness. This simplifies the problem 
arising with those betatrons and synchrotrons 
which use cone-shaped targets where the actual 
thickness through which the electrons pass is 
uncertain. 

In the preceding technique, it is essential 
that the absolute bremsstrahlung cross section, 
as well as the spectral distribution, be known. 
The theoretical cross section is accurate for 
low- and medium-Z elements, but for high-Z 
elements, where the conversion efficiency is 
greatest, the deviation from theory is most 
pronounced. This deviation is inaccurately 
known; therefore, one might avoid a possibility 
for error by utilizing a medium-Z converter 
despite its lowered efficiency for producing 
bremsstrahlung. 


3. Computing Cross Sections 


The resolution of yield data Y(e) into a cross- 
section curve is subject to considerable error. If 
activations are made at m operating energies 


® K. Brown and R. Wilson, Phys. Rev. 93, 443 (1954). 


PHOTONUCLEAR REACTIONS 








289 


above threshold, equally spaced in energy, then 
Eq. (4) may be written as 


Y;=a0,, 
V2=d20;+be02, 
pee ne” eee (15) 
Vin =Amo1+)mo2+ * +. 

The Y’s are normalized as in Sec. 1. The o’s 
represent averages in each quantum-energy in- 
terval and the coefficients are calculated from 
the Pe,e4 data; thus, c;/c, would be the ratio of 
photons in the third quantum-energy interval 
above threshold at operating energies ey, rela- 
tive to ey,. Cross sections at the higher energies 
will evidently depend on those obtained at lower 
energies, and the errors associated with the 
low-energy cross sections will be carried along 
with them. When subtracting quantities which 
are of almost the same magnitude, a large net 
fractional error is inevitable though separately 
the errors are small. Consequently, cross sections 
at the high end of the curve will experience 
enormous errors. An additional complication 
would result if the calculated value of one of the 
partial cross sections, say oj, were too large. 
The next oj;: would be correspondingly too 
small and o;,2 too large again. Hence, o, would 
be plotted with an unnatural oscillation resulting 
solely from the error in a single piece of data. 
This fact is actually taken advantage of in a 
method for analyzing the data which restricts 
the errors in the partial cross sections to such 
values which will avoid this oscillation. In this 
“photon difference method” of Katz and 
Cameron* the yield curve is smoothed by the 
requirement that it have smooth first and second 
derivatives. ; 

One method for determining photonuclear 
cross sections consists in subtracting the ab- 
sorption cross sections due to electronic inter- 
actions, from the total cross section. This may 
be measured with photonuclear detectors; in 
a more recent method, the bremsstrahlung 
quanta are analyzed by total absorption in a 
sodium iodide crystal after transmission through 


#L. Katz and A. G. W. Cameron, Can. J. Phys. 29, 
518 (1951). 

41 Haslam, Horsley, Johns, and Robinson (to be pub- 
lished). 





290 


a known sample.” But, as is clear from Fig. 1, 
the result is extremely sensitive to inaccuracies 
in the electronic absorption data. 


C. Measurement of Reaction Energies 
1. Reaction Threshold Energy 


The determination of the exact energy at 
which a photonuclear reaction begins has been 
the subject of a number of careful experiments 
largely with 20-Mev betatrons.* These machines 
nearly always are calibrated at the threshold 
energies of photoinduced reactions whose par- 
ticle binding energies are well known. Therefore, 
threshold energy measurements on unknown 
nuclei will be relative to these standards. 

In the measuring procedure, the yield at several 
eu slightly above threshold is normalized to a 
constant total ionization, since the operating 
range is so restricted. The yield near threshold 
in most cases is observed to increase with the 
square of the energy above threshold, so that the 
intersection of the straight line plot of [ Y(ea,) ]! 
vs ey (after subtracting the background activity 
produced when operating below threshold) is at 
the threshold energy. 


2. Mean Reaction Energy 


The resonance character of photonuclear cross 
section curves makes it convenient to determine 
the effective quantum energy for the reaction 
by more direct means than actually plotting the 
curve of cross section vs energy. This effective 
energy is defined by 


n= f cade / f ode. 


Leona Marshall found the effective energies 
for the C"#(y,2) and Cu®(y,”) reactions by 
measuring the total absorption, in various ele- 
ments, of the radiation which excited the re- 


( 4 oy W. Koch and R. S. Foote, Phys. Rev. 91, 455 
1953). 

4 McElhinney, Hanson, Becker, Duffield, and Diven, 
Phys. Rev. 75, 542 (1949); Sher, Halpern, and Mann, 
Phys. Rev. 84, 387 (1951). 


(16) 


ARTHUR I. 


BERMAN 


actions.“ The effective energy was determined 
from the known energy dependence of the ab- 
sorption coefficients. The radiation first entered 
a foil of carbon or copper whose activity was to 
be counted. It then passed through the absorber 
followed by a second foil similar to the first. 
The relative number of counts induced in the 
two foils with and then without the absorber in 
place gave a measure of the absorbed radiation. 
The absorption cross section for a given element 
may be written 


oe=(Z+1)pe+ce.=aZ+b, (17) 


where o,=absorption cross section per electron; 
c.=Compton cross section per electron, and 
a,b=functions only of quantum energy. The 
experimental cross section o, is plotted against 
Z and values are found for both a and b. These 
are equated, respectively, to the theoretical pair 
production cross section (from which the triplet 
cross section has been removed) and the Comp- 
ton-triplet cross section, and two independent 
values of the quantum energy for absorption are 
determined. 

In Strauch’s method the relative activation 
of the rear foil at various absorber depths is 
measured.** The experimental procedure is analo- 
gous to the plotting of relative ionization curves 
discussed in Sec. IIB2, except that now the beam 
energy which induces a photonuclear reaction, 
rather than that which forms ion pairs, is plotted. 
If A./Ao is the relative activity at depth x to 
that at zero depth, then the area under the 
curve Jo*(A-/Ao)dx is called the track length. 
Clearly, the interaction of bremsstrahlung quanta 
with the absorber material produces a spectral 
distribution which is a changing function of 
depth. An identification of a single energy in 
this distribution, taken over the entire absorber 
depth, is made through the sharp-line resonance 
assumed for the induced reaction. The resulting 
track length is a unique function of this energy, 
as shown by Eyges,** and the effective energy is 
thus obtained. 

“LL. Marshall, Phys. Rev. 83, 345 (1951). 


 K. Strauch, Phys. Rev. 81, 973 (1951). 
4 L. Eyges, Phys. Rev. 81, 981 (1951). 





















Systems of Units in Mechanics—A Summary 


M. LorEN BULLOCK 


Hamilton College, Clinton, New York 


(Received October 28, 1953) 


The theoretical basis and the historical background of five systems of units are described : the 


British gravitational system, the British absolute system, the metric absolute (cgs) system, the 
metric gravitational system, and the mks system. No one system is considered superior to any 
other, for each one has its advantages and its usefulness. It is shown that a law of physics can 
have exactly the same form in each of the five systems, but common practices are described in 
which this principle is not maintained. It is suggested that a student needs to see the complete 
picture of the systems, and that this should be presented as early in the physics curriculum as 


possible. 





INTRODUCTION 


N the past 150 years much has been written 
concerning the particular units to be asso- 
ciated with physical concepts. For many years 
discussion concerned whether to measure length 
in meters or yards and to measure mass in pounds 
or grams. More recently the discussion has been 
whether mass should be in pounds or slugs, grams 
or kilograms. It should be worthwhile, therefore, 
to review the present state of our systems of 
units in mechanics and to see their place in the 
teaching of introductory and _ intermediate 
courses in physics. 

The systems of units in use today are the re- 
sult of a long historical development! which of 
course has not ended. At the present time this 
development has led to at least five systems of 
units which are in common use, and a student 
usually meets units from all five before pro- 
gressing very far in a science course. The develop- 
ment of these systems went hand in hand with 
the development of the physical concepts. Thus 
the earliest fundamental units were those of 
weight, length, and time, which form the present 
British gravitational or British engineering sys- 
tem of units. After the concept of mass had been 
developed during Newton’s time, Carl Friedrich 
Gauss suggested a so-called absolute system 
which was based on the units of mass, length, and 
time. Gauss, however, based his system not only 
on the unit of mass instead of weight, but also 
upon the new metric standards of length and 

1W. Hallock and H. T. Wade, The Evolution of Weights 
and Measures and the Metric System (Macmillan Company, 
New York, 1906); R. Glazebrook, Nature, we 


No. 3218 (July 4, 1931); Natl. Bur. Standards U.S. Misc. 
Publ. No. M64 (1925). 


291 


mass, and his system became the basis for the 
present cgs system. 

The multiplicity of the systems is thus based 
upon two facts: (1) there are two standards for 
length and mass: metric and British, and (2) 
there is a choice for the fundamental units be- 
tween those of mass, length, and time (absolute 
systems) and force, length, and time (gravita- 
tional systems). This gives four possible combin- 
ations which lead to the metric absolute (cgs), 
metric gravitational, British absolute, and Bri- 
tish gravitational systems of units. In addition 
there has recently come into use a fifth system, 
the mks system, but it too is a metric absolute 
system and is a simple multiple of the cgs system. 
These five systems with their units for many of 
the physical concepts of mechanics are summar- 
ized in Table I.? The defining equation for each 
concept is also shown. 


METRIC VERSUS BRITISH 


One difference between metric and British 
systems is simply that there are two standards of 
length: the meter and the yard. ’ 

The standard of length for metric systems is 
the International Prototype Meter, a platinum- 
iridium bar which is kept in a vault of the 
International Bureau of Weights and Measures 
near Paris. Copies of this bar were constructed 
and compared with the International Prototype 
and were then delivered to the various govern- 
ments in the international organization which 
had formed the International Bureau in 1875. 

2 The abbreviations in the table and in the remainder of 


the article are essentially those suggested by D. Roller, 
Am. J. Phys. 21, 293 (1953). 


292 


LOREN BULLOCK 


TABLE I. The systems of units for mechanics. The defining equation for each concept is shown, vectors being indicated 
with boldface type. The units of the three fundamental quantities in each system are shown in capitals, indicating that 
these concepts are defined operationally. All other units in a particular system are derived from the fundamental units of 


that system. 








Gravitational 


British 





TIME 
t 
LENGTH 


s 
Velocity 
v=ds/dt 
Acceleration 
a=d’s/d? 
Force 


Mass 


Work, Energy 
W=/JSF-ds 


Impulse, Momentum 
J=SF-dt 


Power 
P=dW/dt 


Moment of Inertia 
I==ms* 
Density 
d=m/V 
Pressure 


SEC 
FT 


ft/sec 

ft/sec? 

POUND 
F 


slug 
m=F/a 


ft-lb 
Ib-sec (or) 
slug-ft/sec 


ft-lb/sec 


lb-ft 
slug-ft? 
slug /ft® 
Ib/ft? 


Metric 


SEC 

CM 

cm/sec 

cm/sec? 

GRAM 
F 


( ) 
m=F/a 


British 


SEC 
FT 
ft/sec 
ft/sec? 


poundal 
=ma 


POUND 


m 
ft-pdl 
pdl-sec (or) 
Ib-ft/sec 
ft-pdl/sec 


pdl-ft 
lb-ft? 
Ib/ft® 
pdl/ft? 


Absolute 
Metric (cgs) 


SEC 
CM 
cm/sec 
cm/sec? 


dyne 
F=ma 


GRAM 
m 


erg 
(=d-cm) 


d-sec (or) 
g-cm/sec 


erg/sec 


cm-d 
g-cm? 


g/cm? 


Metric (mks) 


SEC 
METER 
m/sec 
m/sec? 


newton 
F=ma 


KILOGRAM 
m 

joule 

(=new-m) 


new-sec (or) 
kg-m/sec 


watt 
( =joule/sec) 


m-new 
kg-m? 
kg/m? 


p=F/A 


The National Bureau of Standards in Washing- 
ton, D. C., has two of these. The one known as 
the National Prototype Meter No. 27 is used as 
the primary standard of length in the United 
States. 

The yard is a much older standard, going back 
at least to Saxon times in England. Some of the 
early standard bars are still in existence, the 
earliest being the one of Henry VII dating from 
1496. It was replaced by Elizabeth I in 1588 with 
a new one which was used until 1824 even though 
some years earlier it had evidently been broken 
and then somewhat crudely mended with a 
dovetail joint! The present standard, the 
Imperial Standard Yard, was constructed in 
1845. It is a solid square bar of a special bronze, 
and it is kept at the Standards Office of the 
Board of Trade in London. Several copies were 
made, and one was presented to the United States 
in 1858. When the International Prototype 
Meter was constructed, however, it was con- 
sidered to be far superior as a primary standard, 


d/cm? new/m? 


and since that time the yard in the United States 
has been defined with reference to the Inter- 
national Prototype Meter. 

Throughout the latter part of the 19th Century 
it was felt by many people that the metric system 
should replace the British. The first step in the 
United States toward the adoption of the metric 
system came in 1866 when Congress made the 
use of metric units legal on an optional basis. The 
proponents of the act fully expected that 
appropriate legislation would eventually be 
passed making the use of the metric system 
obligatory, as was being done in most of conti- 
nental Europe.’ Such legislation has never been 
passed in the United States, however. The 
British system proved to be too firmly rooted in 
everyday living, and all attempts to eliminate the 
customary units were resisted vigorously. Many 

3 The use of metric units was made obligatory in the 
Low Countries in 1820, France in 1837, Spain in 1860, 
Italy in 1861, Germany and Portugal in 1872, Austria in 


1876, Norway in 1882, Sweden in 1889, and in Denmark in 
1912. 















pamphlets and books were written on both sides, 
and some were quite vehement. Little is heard of 
the argument any more, however, and the 
metric system in the United States today re- 
mains legal, but optional. It nevertheless has 
gained wide usage in scientific and engineering 
fields. 

In most European countries, the metric system 
is used even in manufacturing and commerce, 
and studies‘ indicate that the use of the metric 
system is quite complete, with little use of the 
former measures. 


ABSOLUTE VERSUS GRAVITATIONAL 


While the difference between metric and 
British systems is due to the existence of different 
standards, the essential distinction between 
absolute and gravitational systems is in the 
definitions of the concepts of mass and force. 

For absolute systems mass can be defined as 
the ratio of the accelerations of two mutually 
interacting bodies.* This is what Bridgman calls 
an operational definition or what Margenau calls 
an epistemic definition. With mass defined 
operationally, force is then defined as the product 
of mass and acceleration, in what Margenau 
calls a constitutive definition. Since the concept 
of mass is used in the definition of force, mass is 
considered to be a fundamental quantity, and 
force is a derived quantity. To define the mass of 
an object operationally, a comparison must be 
made with some other object which can then be 
taken as standard. The mass of the standard 
object is then defined to be unity, and here again 
the metric and British systems differ, for each 
has a different standard object. The metric 
standard of mass is a cylinder of platinum- 
iridium alloy known as the _ International 
Prototype Kilogram, which was constructed in 
1879 and is kept at the International Bureau of 
Weights and Measures. The British standard of 
mass is a brass cylinder constructed in 1844 and 
known as the British Imperial Pound. 


4A. E. Kennelly, Vestiges of Premetric Weights and 
Measures (Macmillan Company, New York, 1928) 

5 See, for instance, Lindsay and Margenau, The Founda- 
tions of Mechanics (John Wiley and Sons, Inc., New York, 
1936), pp. 91-94. 

6H. Margenau, The Nature of Physical Reality (McGraw- 
Hill Book Company, Inc., New York, 1950), Chap. 12, 
‘The Role of Definitions in Science.” 


SYSTEMS OF UNITS 








IN MECHANICS 293 


For gravitational systems, however, force is 
the fundamental quantity defined operationally, 
while mass is the derived quantity defined as the 
ratio of force and acceleration. One operational 
definition of force involves the extension of a 
spring which is calibrated by observing the 
extension when some standard object is hung 
from it. The gravitational unit of force is there- 
fore related to the weight of the standard object. 
By the weight of an object is generally meant the 
force of gravitational attraction on the object by 
the earth. There is actually a slight difference 
between the apparent and the true force of 
gravitational attraction because of the presence 
of the centripetal force due to the rotation of the 
earth. The apparent force equals the centripetal 
force subtracted vectorially from the true force. 
This in part accounts for a variation of weight 
with latitude. But a unit of force is desired which 
does not vary with location. So the International 
Committee of Weights and Measures has de- 
fined a standard location to which all weighings 
are to be referred. The weight of an object at this 
standard location is called its standard weight. At 
any other location, the stretch of a spring is 
recalibrated so that it will still read in terms of 
standard weight. The standard weight of an 
object, therefore, does not vary with location. 
On the other hand, the local weight, i.e., the 
apparent force of gravity, does vary with location, 
and only at the standard location does the local 
weight equal the standard weight. The unit of 
force, therefore, can now be defined uniquely as 
the standard weight of the standard object. In what 
follows, the term weight by itself will be assumed 
to mean standard weight. Local weight is a 
vector. Standard weight, however, is often con- 
sidered to be a scalar, merely a nuntber to be 
associated with the object. Although, as was 
mentioned, a spring must be recalibrated with the 
standard object for measurements at different 
locations, a beam balance will measure standard 
weight’ at any location, because whenever a 
measurement is made, the standard object or 
accurate copies are used. 

The distinction between standard weight and 

7 A beam balance is often said to measure mass. It does 
so only indirectly, however. For a discussion of this in an 
introductory text, see L. W. Taylor, Physics, the Pioneer 


Science (Houghton-Mifflin Company, Boston, 1941), p. 117. 
Also see reference 5. 


294 M. 


local weight has been emphasized because it 
brings out two aspects of g, the acceleration due 
to gravity. One point of view is that g is the ratio 
of local weight and mass. As such it is a vector 
quantity (directed down), and it varies with 
location. This can correctly be called the 
acceleration due to gravity. The other point of 
view is that g is the ratio of standard weight and 
mass. This is a scalar and a constant, symbolized 
by g.. It is often called standard gravity, although 
Sleator® has aptly suggested that it be called 
simply the weight per unit mass in order to 
emphasize that it is used even when there is no 
freely falling body. For g, is simply a constant 
used to calculate mass when standard weight is 
known or to calculate the standard weight when 
the mass is known. The International Committee 
of Weights and Measures has assigned to g, the 
value 32.1740 ft/sec? or 980.665 cm/sec? in 
accordance with the Committee’s definition of 
standard weight. Thus standard location is by 
definition any place where the acceleration due to 
gravity has these values, which are actually very 
close to those at sea level at latitude 45 degrees. 

The unit of force for gravitational systems has 
thus been defined in terms of the standard 
weight of some standard object. No description 
has yet been given of the standard objects 
actually in use. The standard objects for mass 
in absolute systems have already been mentioned : 
the International Prototype Kilogram and the 
British Imperial Pound. For gravitational systems, 
the same standard objects are used. Thus the 
British Imperial Pound is the standard object for 
mass in the British absolute system, and it is also 
the standard object for weight (and hence force) 
in the British gravitational system. Moreover, 
the units are given the same name, the pound. 
This is the origin of the statement that a one 
pound mass weighs one pound. More accurately,a 
one pound mass in British absolute units weighs 
one pound in British gravitational units. That this 
is so is, of course, due merely to the choice of the 
same standard object for both systems. The same 
procedure holds true for the International 
Prototype Kilogram for metric systems. Hence 
a gram is the unit of mass in metric absolute 
units and the unit of weight in metric gravita- 
tional units. 


8 W. W. Sleator, Am. J. Phys. 15, 251 (1947). 


LOREN BULLOCK 


One further matter of terminology must be 
mentioned. Before the definition of standard 
location, the unit of weight was unsatisfactory to 
use as a standard because it varied with location. 
This was the main reason that Gauss suggested 
the use of absolute systems based on mass, length, 
and time. It was felt, therefore, that absoulte 
systems were more fundamental or ‘“‘absolute’’ 
than gravitational ones. As developed here, how- 
ever, both are equally ‘‘absolute,’’ and they are 
entirly distinct from one another. Moreover, in 
principle, amy three mechanical units can be used 
as fundamental ones, and all others can be 
formed from them. So the choice between force, 
length, and time, or mass, length, and time, or 
any other three mechanical units is simply a 
matter of convenience, coupled with custom and 
habit. The terms absolute and gravitational must, 
therefore, be considered only as traditional names 
for the systems and nothing more. 


THE BRITISH GRAVITATIONAL SYSTEM 


In this system the fundamental unit of length 
is the foot, defined as one-third of the yard. The 
unit of force is the pound. More specifically, it is 


the standard weight of the British Imperial 
Pound. The weight of an object in this system is 
therefore measured in pounds. 

The unit of mass in this system is, from the 
definition of mass for gravitational systems, that 
mass of an object which has an acceleration of one 
ft/sec? when a force of one pound is exerted on 
the object. To measure mass, therefore, a force 
can be applied to an object and the corresponding 
acceleration measured. A much simpler procedure 
is used in practice, however. The weight is first 
measured with a beam balance or a calibrated 
spring balance, and then the mass is calculated 
by dividing the weight (in pounds) by standard 
gravity, g,. For many years the unit of mass went 
without a name, although recently the term slug 
has been increasingly used. Whether the unit is 
named or not, however, the concept of mass must 
be recognized as such and not confused with 
weight. 

The other physical concepts in this system 
have simply derived units. The definition of work 
as the space integral of the force means that its 
units are those of force multiplied by length. 
This is customarily called the foot-pound. Since 





SYSTEMS OF UNITS IN MECHANICS 


this space integral also equals the change in 
kinetic energy, the foot-pound is also used as the 
unit of energy. The definition of impulse is the 
time integral of the force, so its unit is the 
pound-second, and since this integral is equal to 
the change in momentum, the units of momentum 
are either pound-second or slug foot per second. 
The unit of power will be a foot-pound per second 
although in practice a larger multiple is used, a 
horsepower, which equals 550 ft-lb/sec. These 
are all derived units following from the definitions 
of the concepts, as shown in Table I. 


THE BRITISH ABSOLUTE SYSTEM 


In this system mass is defined operationally, 
and the unit of mass is based on the mass of the 
British Imperial Pound. In this system, there- 
fore, the pound is the unit of mass. The force 
unit, on the other hand, is now a derived one. It 
is that force which is acting when an object of 
one-pound mass has an acceleration of one ft/sec?. 
The name given to this unit of force is the 
poundal. Weight would therefore be measured 
in poundals in this system. In actual use, how- 
ever, the unit of force in this system often goes 
nameless. 

To measure the mass of an object, the opera- 
tional definition suggests that the object must be 
made to interact with the standard object and 
the resulting accelerations measured. Here again, 
however, a much simpler procedure is used in 
practice. The object is merely weighed on scales 
calibrated for the British gravitational system. 
In other words, the object is weighed in pounds 
and since the same standard object is used for 
weight in British gravitational units and for mass 
in British absolute units, this weight in pounds 
corresponds to the mass in pounds in the British 
absolute system. It should be realized, however, 
that weighing the object in pounds and then 
writing down the result as a mass of so many 
pounds involves a conversion—often uncon- 
scious—from gravitational to absolute units. 

Other units in the absolute system follow from 
the definitions of the concepts, as shown in Table 
I. It is unfortunate that both the British absolute 
and the British gravitational systems are often 
mixed together in practice, sometimes giving the 
impression that there is only one British system. 
Force is commonly measured in pounds, energy 


295 


in foot-pounds, and power in horsepower, all of 
which are British gravitational units. On the 
other hand, many tables in handbooks use the 
British absolute system for densities (lb/ft), 
moments of inertia (Ib-ft?), and any quantity 
involving mass measured in pounds. As devel- 
oped here, however, the pound as a unit of mass 
and the pound as a unit of force belong to two 
distinct systems of units. That this is not always 
made clear is in part due to an imperfect under- 
standing of the concept of mass, which often is 
identified with weight. 


THE METRIC ABSOLUTE SYSTEM (CGS) 


In this system the unit of length is the centi- 
meter, which is 1/100 of the length of the Inter- 
national Prototype Meter, and the unit of mass 
is the gram, which is 1/1000 of the mass of the 
International Prototype Kilogram. The defini- 
tions of the units parallel those of the British 
absolute system. The only difference is that 
different standards are used for length and mass. 
The derived unit of force, the dyne, is therefore 
that force acting when an object of one gram 
mass has an acceleration of one cm/sec”. Notice 
the correspondence of the dyne in this system 
with the poundal in the British absolute system. 

The other derived units follow naturally. The 
unit of work and energy, the dyne-centimeter, is 
given a special name, the erg. The unit of pressure 
is dyne per square centimeter, but multiples of 
this are often used in practice, so that 10* d/cm? 
is called one millibar, and 1.01310* d/cm? is 
called one atmosphere. 

The units of this sytem are the familiar cgs 
units which have been in use for so many years. 
The international electrical units were based on 
this system, although the mks system is now 
often used instead. It is nevertheless still ex- 
tensively used in many fields and will probably 
continue to be used for many years to come. 


THE METRIC GRAVITATIONAL SYSTEM 


For this system the unit of force is based upon 
the standard weight of the International Proto- 
type Kilogram, and the fundamental unit of force 
used is the gram. The unit of mass would then be 
that mass of an object having an acceleration of 
one cm/sec? when acted upon by a force of one 





296 M. 


gram. No special name is given to this unit of 
mass, although the term metric slug is sometimes 
suggested. The use of this system seldom extends 
to other units than these. For this reason there 
are many blank spaces in Table I for this system. 
Weighing in grams, however, is extremely 
familiar even though few calculations are carried 
out in this system, conversion usually being made 
to the cgs system. This is simply done, of course, 
for the weight in grams in metric gravitational 
units is the mass in grams in cgs units. 

The statement is sometimes heard that atomic 
weights are “‘really’’ atomic masses. The reason 
is that atomic weight measured in grams is mass 
in cgs units, but as used here, atomic weight 
measured in grams is also weight in metric 
gravitational units. It is thus quite correct to 
speak of atomic weights as such. 


THE METRIC ABSOLUTE SYSTEM (MKS) 


In recent years another metric absolute system 
has come into use which, instead of using centi- 
meters, grams, and seconds as fundamental units, 
uses meters, kilograms, and seconds. The ad- 
vantage is that its use leads directly to the 


practical electrical units which are in common 
use. Since it is an absolute system, the kilogram 
is the unit of mass, and the derived unit of force 
(kg-m/sec?) is called the newton. Several of the 
other derived units correspond to familiar 
practical units, so that the unit of work, a 
newton-meter, is a joule, and the unit of power, a 
joule per second, is a watt. 

There are those who consider that the mks 
system will eventually supersede the cgs system, 
and it would be convenient to have one system 
for everything. But the cgs system is quite 
throughly entrenched in many fields, and old 
units die hard, if at all, as history has shown. 

For completeness, an mks gravitational system 
could also be described in which the kilogram 
would be the unit of force. Such a system would 
be used in the same manner as the metric 
gravitational system with conversion to absolute 
units for calculations. An mks gravitational sys- 
tem has not been listed in Table I, however, 
because the term mks, like cgs, is used to refer 
only to an absolute system.°® 


® P. Moon and D. E. Spencer, Am. J. Phys. 16, 25 (1948). 


LOREN BULLOCK 


U. S. CUSTOMARY UNITS 


For many years the United States based its 
weights and measures upon the British standards 
in London. In the early part of the 19th Century, 
however, the volume measures were standardized 
in Great Britain, but the United States did not 
follow suit. This accounts for significant differ- 
ences in the present day volume units between 
the two countries. 

The basis of the volume measures was the 
pint,!® which was early legalized in England for 
wine measure. It was that volume which would 
hold a pound of wine. Later other pints came into 
use for other commodities, including dry ones 
such as grain and corn. Each of them was the 
volume which would hold a pound of that partic- 
cular commodity, and in the United States today 
two of these pints are still used : one of 33.60 cubic 
in. for dry measure and one of 28.875 cubic in. for 
liquid measure. A U. S. bushel is defined as 64 dry 
pints, and a U. S. gallon is defined as 8 liquid 
pints. 

In Great Britain, however, the reforms did 
away with the various pints in use and defined 
the British Imperial Gallon as the volume of 10 
lb of water at standard conditions of temperature 
and pressure. The British pint was then defined 
as } of the imperial gallon, or approximately 
34.7 cubic in. The British bushel was defined as 8 
imperial gallons (i.e., 32 quarts or 64 pints). Thus 
in Britain the same measure is used for both dry 
and liquid commodities, and pints, quarts, 
gallons, and bushels are simple multiples of one 
another.!! 

The volume measures between the two coun- 
tries are thus significantly different. For length 
and mass, however, there is no difference except 
for the most precise measurements.” The present 


10 K. G. Irwin, Sci. Monthly 72, 9 (1951). 

11 For a more complete discussion of the differences be- 
tween the units of the two countries, see H. W. Bearce, 
Sci. Monthly 43, 566 (1936). 

12 The yard and the pound are defined in terms of the 
meter and the kilogram by the National Bureau of Stand- 
ards by authority of the Mendenhall order of 1893 which 
states that the definition will be in accord with the Act of 
1866. This act established that 1 yard = 3600/3937 meter 
and 1 pound =1/2.2046 kilogram. Actually a more precise 
comparison between the British Imperial Yard and the 
International Prototype Meter gave a ratio of 36 000 000/ 
39 370 113, and this became the legal equivalent in Great 
Britain in 1898. In addition a comparison of the British 
Imperial Pound and the International Prototype Kilogram 
in 1883 gave a ratio of 1/2.204622341. The Bureau of 





SYSTEMS OF UNITS 


legal definition of the yard in the United States 
leads to the relation 1 inch = 2.5400051 cm, while 
in Great Britain the legal equivalent of the 
meter and the yard leads to the relation 1 inch 
= 2.5399978 cm. In recent years, however, many 
industrial and engineering interests have urged 
the legal adoption of the relation 1 inch=2.54 
cm exactly, but no official action has been taken 
either in Great Britain or in the United States. 
Nevertheless the relation is being used increas- 
ingly in industry. It essentially redefines the 
yard as 1 yard =0.9144 meter exactly. 

In the discussion of British weights so far, the 
British Imperial Pound, or the legally defined 
equivalent in the United States, has been des- 
cribed as the standard. This standard, however, 
is for avoirdupois weights which are to be dis- 
tinguished from troy weights. The troy pound is 
today defined as 5760/7000 of the avoirdupois 
pound. Since 1855, when the present imperial 
pound avoirdupois was legalized, the use of troy 
weights has been restricted to certain fields such 
as pharmacy and in dealing with gems and pre- 
cious metals. Even their use in pharmacy, how- 
ever, is disappearing, and the metric system is 
being used increasingly instead. 


CONVERSION BETWEEN THE SYSTEMS 


The systems of units which have been dis- 
cussed are entirely distinct from one another so 
that a particular physical problem may be cal- 
culated exculsively in any one of them. To trans- 
fer, or convert, from one system to another may 
be easily accomplished with conversion factors 
like 2.54 cm/in. or 453.6 g/lb. As an example of a 
solution in each of the five systems, consider the 
following problem which is solved with Newton’s 
second law, F=ma. 

The Problem.—What will be the force required 
to give a 2000 lb car an acceleration of 4.00 
ft/sec? ? 

In British Gravitational Units.—The 2000 lb is 
the weight of the car, so its mass is calculated by 
Standards however, considers that to define the yard 
legally, the 1866 yard to meter ratio must be taken as 
exact. This accounts for the very slight difference between 
the U. S. legal yard and the British legal yard. The Bureau, 
on the other hand, defines the legal pound according to the 


1883 ratio and not by the Act of 1866. 
13 Am. Standards Assoc. B48.1—1933. 


IN MECHANICS 


dividing by g,, giving 
m = W/g,= (2000 Ib) / (32.2 ft/sec?) = 62.1 slugs. 
Thus the force will be 
F=ma= (62.1 slugs) (4.00 ft/sec?) = 248 pounds. 


In British Absolute Units.—The 2000 lb is now 
considered as the mass of the car. Thus the force 
will be 


F=ma= (2000 Ib) (4.00 ft/sec?) = 8000 poundals. 


Notice that the problem does not specify whether 
the 2000 pounds is mass or weight. It is either one 
depending upon what system of units is used. 
In CGS Units.—Each quantity must now be 
converted to cgs units. The conversion factors 


needed are 454 g/Ib and 30.5 cm/ft. The mass is 
then 


m = (2000 Ib) (454 g/Ib) = 908 000 grams, 
and the acceleration is 
a= (4.00 ft/sec?) (30.5 cm/ft) =122 cm/sec?. 
The force is therefore 
F=ma = (908 000 g) (118 cm/sec?) 
= 111 000 000 dynes. 


In Metric Gravitational Units —In cgs units, 
the conversion factor, 454 g/lb, was used to con- 
vert pounds mass to grams mass. For this system 
it converts the 2000 lb weight to grams weight. 
Thus the weight of the car is 908 000 grams, and 
the mass is obtained by dividing by g,, giving 


m= W/g,= (908 000 g)/(981 cm/sec?) 
=925 g sec?/cm. 


The mass unit in this system is of course given 
no special name. The acceleration will again be 
122 cm/sec’, so the force is 


F=ma = (925 g sec?/cm) (122 cm/sec?) 
= 111 000 grams. 


In MKS Units.—Since the units in this system 
are a simple multiple of the cgs units, the con- 
version from cgs to mks is simplest, using the 
conversion factors 1000 g/kg and 100 cm/m. 
Thus the mass is 


m = (908 000 g)/(1000 g/kg) = 908 kg, 
and the acceleration is 


a= (122 cm/sec?) /(100 cm/m) = 1.18 m/sec”, 





298 M. 


so the force is 
F=ma= (908 kg) (1.22 m/sec?) = 1110 newtons. 


These are solutions of the problem in the five 
systems. It is the same information written in five 
different ways. The noticeable thing is that the 
form of Newton’s second law is exactly the same 
whatever system is used. This “‘invariance’”’ of a 
law of physics with respect to systems of units is 
certainly a desirable principle to maintain, for it 
allows proper emphasis upon physical concepts 
rather than units. It is unfortunate, therefore, 
that Newton’s second law is often written in 
different forms in different systems. For example, 
some texts use Newton’s second law in the form 
F=(W/g)a for gravitational units and in the 
form F=ma for absolute units. Since m= W/g, 
it is obvious that this will give the correct 
answers. One reason for this usage is, perhaps, to 
avoid naming the mass unit in either of the 
gravitational systems, but the concept of mass is 
implicitly used whether it is named or not. 

There is another practice in which Newton’s 
second law is written as F=ma/g, with the mass 
in pounds and the force in pounds for British 
units, or the mass and force in grams for metric 
units. Of course the answer is given numerically, 
but physically the concept of mass is being con- 
fused with the concept of weight. Perhaps a 
reason for this practice is that the pound is the 
common unit of force, but the pound has been 
legally defined only as a standard of mass. It is 
evidently not generally realized that the standard 
objects are also used—whether legally or not— 
as standards of weight. 

There is still another practice which attempts 
to make the form of Newton’s second law uniform 
in all systems by writing it F=kma, where k isa 
constant having no physical significance but 
depending on the system of units used. This 
practice, however, obscures the physical meaning 
of the concepts of mass and force. From one 
viewpoint, for example, mass has been described 
as the ratio of force and acceleration. In other 
words, mass is the proportionality constant be- 
tween force and acceleration. An additional 


4 For a discussion of this practice, see Sleator, reference 


15 For a recent discussion of this practice, see C. H. Page, 


Am. J. Phys. 20, 1 (1952), and also F. T. Worrell, Am. J. 
Phys. 20, 456 (1952). 


LOREN BULLOCK 


proportionality constant k which has no phy- 
sical meaning tends to hide the real meaning of 
mass. 

In the considerations of each of these prac- 
tices, the concept of mass plays an important 
part. Perhaps this is the key. How is the concept 
of mass to be introduced to the students? In 
some elementary texts meant for secondary 
school use, the concept of mass is barely used at 
all. Perhaps this is a wise procedure for that level 
of sophistication, but not for the college level. 
Nor is it enough to define mass simply as 
‘quantity of matter.’’ That this can be used as a 
definition of mass is not denied, but it should not 
be the only definition.!* The essential difficulty 
with it is that the term matter is generally 
assumed to be understood intuitively. But for 
classical mechanics, the question, ‘‘What is 
matter ?’’ is answered by, ‘“‘Matter is that which 
has mass and weight.” In other words, an under- 
standing of the properties of matter comes only 
with a clear distinction between the concepts of 
mass and weight. The concept of mass, therefore, 
should be carefully introduced and preferably 
from the inertial point of view. 


CONCLUSIONS 


The historical and logical foundations of the 
five systems of units in common use have been 
presented in order to show that each system has 
its own merits and usefulness. In other words, no 
one system is “‘better’’ than any other. Moreover, 
it has been emphasized that units from all five 
systems are in common use. At some time in a 
student’s career, therefore, all five systems should 
be thoroughly discussed. The question is, 
“‘When?’’ Many introductory texts follow the 
procedure of introducing only two or three of the 
systems, leaving the rest to be discussed in more 
advanced courses. Many advanced texts, how- 
ever, limit themselves to one or two systems and 
assume that a student is already familiar with the 
other systems. As a result, when a student comes 
into contact with various units, he tries to fit 
them into the one or two systems that he knows. 
The systems most often used are the British 
gravitational, the cgs, and the mks systems. A 


16 That a physical concept has more than one definition 
is often surprising to students. At some time in a student’s 
career, this fact and its significance should certainly be 
presented. See reference 6. 





SYSTEMS OF UNITS IN MECHANICS 


beginning student, however, is usually familiar 
with the gram as a unit of weight, having learned 
to weigh things in grams in high school and 
college chemistry courses. It is no wonder, then, 
that he becomes confused in a physics course 
where only the cgs system is used, with the gram 
as a unit of mass and with weight measured in 
dynes. Even if the theoretical distinction between 
absolute and gravitational systems is not pre- 
sented to a beginning student, he should at least 
understand that a gram is both a unit of mass and 
a unit of force, but in different systems of units. 


299 


In an introductory physics course, therefore, all 
five systems should at least be described, even 
though their complete theoretical basis is post- 
poned for later courses. 

The emphasis upon units and systems of units 
made in this article should not obscure the fact 
that proper emphasis, after all, belongs not upon 
the units, but upon the concepts of physics and 
upon the relations between those concepts. The 
organization of the units into these five parallel 
systems, on the other hand, should do much to 
clarify a student’s thinking. 


Displacement Current, A Useless Concept 


F. W. WARBURTON 
University of Redlands, Redlands, California 
(Received September 15, 1953) 


The Maxwell relation, curlB = (uk/c)dE/dt+4pj, is obtained directly by differentiation 
without mention of displacement current. It is also deduced as a consequence of propagation 
with speed c, and the form of the expressions chosen for the fields E and H. These methods of 
examining retarded action of charge on charge have the advantage over emphasis on continuous 
fields that weaknesses of classical theory become evident, and closer correlation with symmetric 


propagation theories becomes possible. 


HE situation today, regarding the concept 

of Maxwell’s displacement current, is in 
some respects analogous to that existing thirty 
years ago regarding the fictitious magnetic pole. 
Having listened with discomfort in earlier years 
to explanations that the pole was fictitious, 
although it shared the basis of elementary elec- 
tric theory equally with electric charge, the 
writer resolved with the ardor of youth to do 
what he could about it. Perhaps one should not 
be less assiduous, in this age of postulates and 
invention of particles, in emphasizing the alterna- 
tive and more logical description of experimental 
facts which is possible with the number of inde- 
pendent concepts reduced further by omitting 
the displacement current. One contribution to 
the pole problem was entitled ‘The magnetic 
pole, a useless concept.’ Even when employ- 
1F, W. Warburton, Am. J. Phys. 2, 1 (1934). Perhaps a 
better phrase in the title of the 1934 paper and in this 1954 
paper would be, ‘“‘A Worse than Useless Concept’’; for the 
endeavor has been to aid the reader in recognizing the 
damage done and the delays made by clinging to loose 


concepts and by failing to accept or to use freely the more 
concrete and specific descriptions of electrical phenomena. 


ing Lp/r?==Z(NiA/l)/r? as a mathematical de- 
vice for computing magnetic fields outside a 
solenoid or magnet, the concept of magnetic 
pole as a source of field is not so useful as the 
concept that p is simply a convenient symbol 
representing the product of current and area 
(magnetic moment) divided by (equivalent) 
length. Probably a smaller percentage of physi- 
cists now would denounce the displacement cur- 
rent as fictitious and obsolete than were willing 
to describe the magnetic pole in these terms 
thirty or more years ago. Yet for a long time, a 
number of physicists have noted the logical 
fallacy of a loose assertion which persists in a 
number of books—that a changing electric field 
produces a magnetic field or that a changing 
magnetic field produces an electric field. They 
have preferred to say that a steadily moving 
charge produces (i.e., is the source of) both a 
changing electric field and a steady magnetic 
field, and that an accelerated charge produces 
both a changing magnetic field and an induced 
(accelerational) electric field. 





300 


The electric field E is by definition the ratio 
of the force F on a charge g to the charge 


E=F/q, (1) 


produced by another charge q’, either as a result 
of relative position only, E,= /rdq’/r’, or due 
to relative position, velocity and acceleration, 
in E;=—A/c, with A= fv'dq'/cr. The field E 
thus defined is not something which produces 
another (magnetic) force. It depicts the state 
of affairs in the neighborhood of qg’ and caused 
by q’ in the sense that, when due allowance is 
made for propagation time, the “state of affairs’’ 
has been changed by the presence of q’. If the 
changing electric field were the cause of magnetic 
force, then the definition of E would be inade- 
quate. Tacking such an extraneous meaning 
onto E after its definition in Eq. (1), is not con- 
ducive to clear thinking on the part of the stu- 
_dent. On the other hand, showing that a changing 
electric field is accompanied by a magnetic field, 
both related to the presence of a moving charge 
qd at a distance r at a sufficiently previous time 
t—r/c, strains neither the fictitious elastic ether 
nor the student’s credulity. The concept that 
such a charge q’ can be the source of both the 
changing electric field and the accompanying 
magnetic field is the less confusing and is suffi- 
ciently complete. 

The magnetic field vector B is by definition 
the ratio of the transverse magnetic force on a 
current element to the product of the current 
and its length given by 


F=a1xB, (2) 


with B set up by a primary current, B= g7'dl’ 
Xr/r’, and is not something which produces 
another (electric) force. It describes the state of 
affairs as measured by the test current il in the 
region around and “produced by”’ 7’. It is well 
to remember that in the case of the fixed pri- 
mary current and changing mutual inductance 
the induced electric field computed from /E;-dl 
= — (1/c)d(/B-dS)/dt=— fB-(vXdl/c) =f 
(vXB/c)-dl is literally and identically a com- 
ponent of the magnetic force per unit charge, 
E=vXB/c, and is not a new force ‘‘caused by”’ 


F. W. WARBURTON 


the magnetic force.? To insist that vXB/c or 
dB/dt causes E; not only is redundant but it 
introduces serious confusion of ideas. The propo- 
sition that a changing intermediary field pro- 
duces another kind of field reminds one un- 
comfortably of the ‘fox which began to drink 
the water which began to quench the fire which 
started to burn the stick which accelerated its 
beating the dog which changed his rate of biting 
the pig until the pig jumped over the stile.”’ 
The magnetic pole and the displacement cur- 
rent were reasonable physical concepts before 
it became known how well the action of charges 
on each other, after the propagation time, re- 
places the “‘lines of force’ of magnets and the 
“strain in the (solid elastic) ether.’’ Now that 
the ether is gone and we do not visualize D in 
vacuum as a physical displacement of charge 
past the point at which D is computed, retention 
of the name “displacement” for D=kE/4z with 
the suggestion that it is needed for Maxwell’s 
displacement current adds an element of vague- 
ness and mystery which is quite unnecessary. 
Indeed it is misleading. The student is en- 
couraged to believe that the vacuous concept of 
current in space where there are no electrons is 
responsible for the magnetic field between the 
condenser plates, whereas the origin x’y’z’ of the 
vector r=i(x—x’)+j(y—y’)+k(s—2’) in the 
equations used for computing both B and 
dE/dt locates the source at the charge q’ at x’y’s’ 
producing both fields at the point xyz. 
Developing the relation between the fields 


uk dE 
curlB =curluH = — > (3) 
cc @ 


by determining the space and time derivatives 
of H and E, respectively, in terms of the motion 
of the charge g’ producing both fields provides 
this relation no less rigorously than does the 
classical procedure of adding the fictitious con- 
cept of the displacement current. The similar 
useful relation of induced electric field in terms 
of space and time derivatives of E and H, for 


? For the case of the moving primary coil, the induced 
electric field in the stationary secondary coil, E;= —dA/cdt, 
has the same value and is a component of the force per 
unit charge vXB/c as computed by the observer moving 
with the primary, even if one dislikes to use the term in- 
duced electric field for the case of the moving secondary. 





DISPLACEMENT CURRENT, A USELESS CONCEPT 


example, 


»p dH 1 dB 
curlE = —- ——= —- —, 
c dt c dt 


(4) 
has not required the concept of a fictitious cur- 
rent density of magnetic poles where there were 
no poles passing through the points at which 
dH /dt was computed. The more detailed method 
of developing Eq. (3) in terms of the motion of 
q’ shows also the conditions under which either 
the time rate of change of induced electric field 
is neglected or curlH is arbitrarily extended, an 
item usually obscured in the emphasis on dis- 
placement current. The method shows further 
that in the contribution of the induced electric 
field E; to the space derivatives of E the leading 
terms in curlE; are of the same order of magni- 
tude as certain terms in divE; which are omitted 
in writing divE=47p in the usual classical de- 
velopment of Maxwell’s equations. 


A DIRECT DEVELOPMENT OF ELECTROMAGNETIC 
EQUATIONS 


Using the classical expressions for the scalar 
and vector potentials in terms of steadily moving 
charges, 


o=e'/r, A=Ze'v'/cr, (Sa) 


one finds at once the corresponding expressions 
for the electric and magnetic fields, E and H, 


1 0A 
E=E,+E;=—V¢—-— 
c at 


= D[e’r/r?—e'f'/cr—e'v' (v' -r)/cer?] (5b) 


H=curlA=2e’'v’ Xr/cr*, (5c) 
since the acceleration of e’ is f’ and dr/dt 
= —(v’-r)/r. Equations (5) are only one set of 
many which satisfy the experimental results for 
closed circuits, and the lack of reciprocity in the 
classical Lorentz magnetic force using Eq. (5c) 
in F= (ev/c) XH indicates that the equations in 
this set are incomplete, i.e., H does not represent 
all the force on the moving charge e(v/c). 
Equations (5b) and (5c), of course, satisfy 
Eq. (4) for the induced electromotance (emf) in 
a closed circuit, as may be seen directly when 
one takes the space derivatives of Eq. (5b) and 


the time derivative of Eq. (5c), 
divE =2[0+¢e’ (f’ -r)/c?r? 
+3e’ (v’ -r)?2/c¢r5 —e’y'2/c*r?], 
curlE = 2[0—e’ (f’ Xr)/cr? 
ame —3e’(v’ Xr) (v’ -r)/cr*] 
sae =L—e’ (f’ Xr)/cr* 
—3e’(v’ Xr) (v'-r)/cr®]. (6c) 


By taking also the space derivatives of Eq. 
(5c) and the time derivative of Eq. (5b), 


divH=0, 
curlH = 2[3e’r(v’ -r)/cr'—e’v' /cr*], 


(6a) 


(6b) 


(7a) 
(7b) 


; — = 2[3e'r(v’ -r) /cr'—e’v'/cr? ] 
+2 [e’v'v’2/c8r? —3e’v’ (v’ -r)?2/cor5 
—e'v’ (f’ -r)/c*r? —2e’f’ (v’ -r) /cr8 
—e’ (df'/dt)/cr], (7c) 


we see that when one neglects the terms in 1/c* 
which arise from the induced electric field E,, 
Eq. (7bc) is equal to Eq. (3) in free space and 
is the mathematical result of the form of the 
equations chosen to describe the fields, E, and H. 
If one neglects terms in 1/c*, or (preferably) 
assumes that E in Eq. (7bc) includes induced 
electric field and hence that curlH in Eq. (7b) 
is to be extended to include terms in 1/c*, the 
propagation equation in E in the space between 
charges is obtained immediately by taking the 
curl of Eq. (6bc) and the time derivative of 
Eq. (7bc) (extended preferably to c-* terms), 


curPE=V divE—V°E 
a 10E 
=—-——curlH=—* —. (8) 
c Ot C2 ar 
This becomes the wave equation in E provided 
it can be shown that the gradient of Eq. (6a) is 
zero, 


V divE=0. (9) 


An easy way to guarantee the condition of Eq. 
(9) is to assume that divVE=0 in Eq. (6a), the 
procedure adopted in Maxwell’s theory for elec- 
tric waves in free space. The introduction of 
terms, resulting from this assumption, which 
compensate for the contribution of induced 





302 F. 


electric field to divE is discussed in more detail 
in connection with Hertz’s solution below. 

Continuing the development in terms of sta- 
tionary and moving charges, we find in the usual 
way the values of divE in a body of charge 
density p and curlH in a wire carrying moving 
charges with mean current density j’. If n’ is 
the number of conduction charges e’ per unit 
length in any cross section AS, n’e’v'/c= Ai’. 
Then H= Ai’ [dz sin0/(r?+2*) ; and 


fH-d 
curlH = lim 
AS 


AS-—0 
rdzdl Ari’ 
—S f = =4nj’. 
_» (+2)! AS 


Extensions to include the permeability » and the 
dielectric constant k are made as usual by re- 
stricting e’ and 7’ to conduction charges and cur- 
rents, so that the net (conduction and polariza- 
tion) charge becomes e’ +e,’ =e’/k, and the total 
(conduction and Amperian) current becomes 
i’ +im’ =i’. Thus, Eq. (7bc) and Eq. (10) are 
represented by Eq. (3) as extended to include 
E; and terms in 1/c* in both curlB and dE/dt. 
Also Eq. (4) must likewise now include E;, while 


divE =47p/k, (11) 


provided one makes the remaining terms (in 
divE,;) vanish in Eq. (6a). Equations (3), (4), 
and (11) as thus extended and restricted then 
yield the propagation equation in E in dielectric 
and magnetic media. 

The somewhat unsatisfactory logic of the 
arbitrary extension of curlH =0E/cdt to include 
induced electric field E; may be avoided by a 
method used in the next paragraphs in dealing 
with another case of redundancy of concepts. 
The extension of divE,=47p/k to include the 
radiation part of E; is retained however to pro- 
vide a restriction justified on a different basis 
while leaving the equations in the Maxwell form. 

The concept that both the fields E and H, and 
the potentials ¢ and A are propagated with speed 
v=c/(uk)* is redundant and unnecessary. Prop- 
agation of potentials alone is sufficient and 
represents more directly the transfer of energy 
by radiation. What meaning can be associated 
with the wave propagation of E save the ability 


(10) 


W. WARBURTON 


to give a charge e an acceleration when the 
““E-wave”’ reaches e? Field E as a “‘moving force”’ 
on e has no meaning until it reaches e. The 
ability to exert force on e when arriving at e 
after the propagation time is an attribute of the 
propagated potentials, hence it is unnecessary 
to insist on any accompanying E wave. Economy 
and clarity of concept are thus provided by treat- 
ing the propagation of potentials only, and find- 
ing E and H at the desired place and time by 
taking the appropriate space and time rates of 
change of the potentials. Both ¢ and A may be 
regarded as expressing the aggregate energy 
transferable to a statcoulomb of electrons e, 
whether the energy be transferred in a manner 
best described by quanta or by wave motion or 
both. There is very little exact direct experi- 
mental evidence for the magnetic field accom- 
panying the electric field as such, and much 
more direct evidence for propagation of elec- 
trical energy. Assuming the equations of prop- 
agation of potentials 


Vo — (1/v*)d°o/d? =C 
V?A — (1/v")0?A/d? =K, 


(12) 
(13) 


determining C=—4mp/k and K=—4zyj from 
the steady-state conditions, and deriving the 
relations between the potentials and between the 


fields, Eqs. (14) and (15) below, provide the 
set of Maxwell equations quite as rigorously 
and with no more assumptions than does the 
conventional procedure of deriving Eqs. (12), 
(13). The experimental evidence more directly 
supports the assumptions made, while the ‘‘dis- 
placement current” recedes into the background. 
Equation (3) becomes simply a derived relation, 
Eq. (15), between the fields. This procedure 
uses E of Eq. (5b) in Eq. (11). Then comparison 
with Eq. (12), 

ap 
—divE = Tet ~ eee —Anrp/k= re 5 


results in the condition relating the potentials, 


c Om pk Ob 


divA = ——- —=-—-——,, 


(14) 
v? dt Cc at’ 


the last step, identifying v=c/(uk)}, obtained 
from the steady-state boundary condition by 





DISPLACEMENT CURRENT, A USELESS CONCEPT 


taking space and time derivatives of Eqs. (5a) 
including k and yu explicitly. Next combining 
Eq. (5b) with Eq. (13) results in 
: uk OA 
V’A=V divA—curl??A =— ——4rypj 
Ce OF 


wk Ob wk OE 
V. 


and adding the condition, Eq. (14), yields the 
condition, [Eq. (3) ], 


pk OE 
cur?A=curluH =— —+4aypj (15) 
ot 


c 


between the fields, without any need for calling 
k(0E/dt)/4rc a displacement current density. 
The development of Eqs. (14) and (15) from 
Eqs. (12) and (13), adds the single concept of 
propagation assumed in Eqs. (12) and (13), 
to replace the two usual assumptions, curluH = 
(uk/c)dE/dt, Eq. (3-15), considered either as a 
physical displacement current or as a mathe- 
matical term added to satisfy the continuity 
equation, and divA= — (uk/c)d¢/dt, Eq. (14), 
the additional assumed relation between the 
potentials. Equation (14) now appears as merely 
a relation between the potentials which is a 
consequence of the form these potentials take, 
whether propagated or stationary. Equation 
(3-15) now appears likewise as simply a relation 
between the fields which is unchanged by the 
equations of propagation. Field E includes the 
electrostatic field E, and the part of the induced 
electric field E; effective in radiation. It permits 
the distinction between Eq. (15) for the radia- 
tion part of E; and Eq. (7bc) for the quasi- 
stationary E;,. 

It is apparent that the use of divVE=4mp/k, 
neglecting terms in 1/c? in Eq. (6a), which are 
of the same order of magnitude as those needed 
and used in Eqs. (6bc) and (5b), is a restriction 
on the propagated field E which is not present 
in the induced electric field expressed as negative 
time rate of change of vector potential A=e’v’/cr. 
Furthermore, it is clear that it is this restriction 
and the neglect of the terms f’ -r/c?, v’ -r/c, which 
insure that the components of velocity v’ and 
acceleration f’ parallel to the direction of prop- 
agation are ineffective. To demonstrate the 


303 


point one need only show that a potential term 
not present in the forms of ¢, and A, is introduced 
by use of divE=4mp/k, and that this added 
potential term yields directly terms in E which 
cancel these terms in E;. This cancellation of 
the terms in f’-r and v’-r is the process by which 
the insistence of their neglect in writing divE 
=4np/k is satisfied. For example the well-known 
Hertz solution® of Eqs. (12) and (13) for an 
oscillating electric dipole is obtained by use of 
spherical coordinates with axes oriented so that 
the vector potential of a moving charge is A, 
and thus depends only on r. A solution of Eq. 
(13) is then found in the form rA,=f(t—r/c), a 
function of r and ¢ only. When boundary condi- 
tions are added the solution becomes A , = p(t—1/ 
c)/cr, where p, the electric moment of the dipole, 
is a ‘‘retarded’”’ function of (t—r/c), while r is 
not considered a function of ¢. Substituting this 
in Eq. (14) (which is the direct result of the 
neglect of terms in divE; by setting divE = 47p) 
and then integrating for ¢, one obtains ¢=¢,+¢2 
with a new scalar potential ¢2= p(t—r/c) cos6/cr 
=A,cos@ in addition to the extended electro- 
static potential ¢,=p(t—r/c) cos0/r®. The radial 
component of the gradient of this added potential 


(E2),= —0¢2/dr=[h(t—1r/c) cos0/cr 
+(t—r/c) cos0/cr?], (16) 


compensates the radial component of the in- 
duced electric field, 


(E,)-= —p(t—r/c) cos6/cr, (17) 


i.e., the f’-r term. Also, divE.= —div(E,+E,) 
cancels both the v’-r term, 2p cos@/cr®, and the 
f’-r term, —# cos6/c*r?, in div(E,+E,;) when they 
are expressed as functions of t—r/c. 

Terms in v’?/c? are neglected in the classical 
magnetic force of one moving charge e’(v’) 
acting on another charge e(v). It is interesting 
that this factor appears in divE;, in use of 
dr/dt= —v'-r/r, only to be neglected again in 
the classical dipole radiation. 

As we have seen, the transverse nature of E 
in the radiation zone is assured formally by re- 
taining divE=4z2p without a physical reason 
therefor. The physical reason is supplied by 
ballistic theory which accounts for v’-r=0 and 


3 Abraham-Becker, Classical Electricity and Magnetism 
(Blackie and Son, London, 1932), p. 223. 





304 F. W. 


f’-r=0 due to mix-up-in-phase* the propagated 
particles (or influence) separating from the 
electron-proton dipole source with speed c, hav- 
ing velocity c+v,’ ahead of the electron e’ and 
velocity c—v,’ behind it and not arriving at e 
with simple harmonic impulses. Thus, we see 
that closer study of details, including now bal- 
listic theory which accounts for electromagnetic 
propagation,’ can account also for the great 
success which the classical approximation, lead- 
ing to Maxwell’s theory, has experienced. 


COMPARISON WITH CLASSICAL THEORY 


Let us see now how closely Maxwell’s formula- 
tion of his theory in terms of displacement cur- 
rent corresponds to the steps (Sabc), (6abc), 
(7abc), (8), (9), (10), (11) in terms of motion 
of charges. Maxwell® stressed electric polariza- 
tion as consisting of electric charge Q displaced 


across the dielectric within a condenser, in re- 


sponse to an equal conduction charge trans- 
ferred by the applied electromotance along the 
external wire, such that the surface density of 
this displacement charge Q is D=kE/4x. He 
did not clearly distinguish this from the physi- 


cal polarization charge density P=D—E/4xr 
=(k—1)E/4m due to charges displaced within 
the atoms and which is equivalent to net charge 
per unit area displaced across the dielectric, nor 
did he note that P approaches D only for the 
conductor (k= ©, E=0), and that P approaches 
zero for vacuum. In Sec. 111, Maxwell asserted 
that “according to [his] theory all charge is the 
residual effect of the polarization of the dielec- 
tric.” It is evident, as Jeans recognized,’ that 
Maxwell’s displacement of charge cannot be 
ordinary charge and that Maxwell’s concept 
becomes unnecessary for the development of the 
theory. Yet Jeans and others use Maxwell's 
“total current,” [j’+ (k/4xc) (dE/dt) dS. 

The development of classical electromagnetic 
theory, as exemplified by Maxwell and Jeans, 
rests its case on the use of the Lorentz force, 
(ev/c) Xe’v’ Xr/cr®; and success of classical the- 


4F. W. Warburton, Phys. Rev. 69, 40 (1946). 

5M. J. S. Dewar, Phil. Mag. 38, 488 (1947). 

6J. C. Maxwell, Treatise on Electricity and Magnetism 
(Clarendon Press, Oxford, 1940), third a Vol. 1, sec- 
tions 60, 75, 76, 111, Vol. 2, Sec. 608 

*i. H. Jeans, "Electricity and acetic rated Uni- 
versity Press,’"Cambridge, 1923), pp. 155, 511 


WARBURTON 


ory despite its drawbacks is usually considered 
to justify use of this nonreciprocal force formula 
for moving charges. This formula was developed 
for the action on an electron by a magnet, now 
believed to consist of closed Amperian currents 
due to electrons rotating and spinning in closed 
paths, and has been verified for the action of a 
closed primary current on a moving electron, 
but not for an unclosed primary current.* The 
relations fH-dl=4m7’ and curlH = 47j’ are valid 
with current traversing a closed path and with 
current density j’ uniform over a small volume 
not near the ends of a uniform current. As these 
formulas have not been verified for unclosed 
primary currents, it is to be expected that they 
should yield div curlH =0 = 4-2 divj’ as they do, 
characteristic of a current following a closed 
path. Circuits ending on condenser plates are 
thus not completely described by these classical 
relations. Instead of taking this result as an 
additional indication of the inadequacy of H 
and the incompleteness of evXH/c for H pro- 
duced by charges moving in unclosed paths, 
Maxwell and his followers chose to emphasize 
the etherial displacement current. The time rate 
of change of an electric field between condenser 
plates was assigned the concept of a current and 
added to the conduction current to form Max- 
well’s ‘‘total current,’’ which satisfies the equa- 
tion of continuity and div curlH=div(4zj’ 
+kE/c) =0, and results in Eq. (3) which corre- 
sponds to Eqs. (7bc), (10). The use of the non- 
reciprocal force (ev/c) Xe’v’ Xr/cr*, which be- 
comes equal to the reciprocal force for a closed 
circuit in e’v’ acting on a moving charge ev, was 
formally justified by the belief that ‘‘all circuits 
are closed.”” But the question of the contribution 
of each electron e’v’ in the primary circuit to the 
magnetic force on the electron ev, was left 
unanswered. Only the expression for the total 
force on ev due to the whole closed primary 
circuit was justified.* And scientists, ever ready 
to embrace a new concept, evidently failed to 
remark that Eq. (3) expresses the relation be- 


8 One cannot, of course, accept as proof of the validity 
of the Lorentz force any case in which any of the reciprocal 
formulas of the Ampere-Weber-Riemann type can also be 
used, for the Lorentz force is incompatible with these for 
unclosed circuits. Very definite proof of the superiority 
of the reciprocal forces is shown in Dunton’s circuit 
breaker [Nature 140, 245 (1937)] and in Cleveland’s 
rectangle [Phil. Mag. 21, 416 (1936) ]. 





DISPLACEMENT CURRENT, A USELESS CONCEPT 


tween H and the rate of increase of electrostatic 
field of a moving charge, even if H should be 
inadequate or have no meaning at all. 

We should note that the classical expression 
div (4xj’+-kE/c) =0 developed in the usual way 
for the electrostatic field in a condenser in terms 
of current density j’ omits variations in the in- 
duced electric field E;, just as does neglect of 
the c? terms in Eq. (7c). The extension of Eq. 
(3) to include the induced electric field in E, is 
then a pure assumption that the electrostatic 
field is inadequate to satisfy Eq. (3), and that 
the addition of the induced electric field is 
needed, this despite the fact that Eq. (3) was 
set up in terms of electrostatic field. And it is 
rarely made clear that Eq. (3) does not repre- 
sent an accurate relation between the unex- 
tended B=curlA of quasi-stationary states, Eq. 
(7b), and the expanded E including E;, Eq. (7c). 

Equations (11) and (9) substituted in Eq. 
(8), yield the wave equation in E. An additional 
assumption, Eq. (14), is needed for deriving the 
equations for propagation of the potentials ¢ and 
A. These steps represent the classical procedure 
whether (k/42c)(0E/dt) is considered a physical 
displacement current or whether it is simply a 
term added to satisfy the continuity equation 
and to build up a self-consistent set of equations 
for the extended E. The former case suffers a 
bit more unsatisfactory logic than developing 
Eq. (8) from Eq. (6bc) and Eq. (7bc) which 
contains 1/c* terms; while the latter case es- 
capes this weak development only indirectly by 
assuming that the field E satisfying the Maxwell 
equations is the correct one, and hence that the 
quasi-stationary induced electric field, E;= —dA/ 
cét is incomplete or includes too much. The 
weak link then appears, in most presentations, 
in that the reader is not forewarned that the r 
component of E; (Eq. 17) is not to be used even 
in low frequency ac and transient currents since 
it is compensated by the r component of Ez, 
(Eq. 16). As alternating and transient currents 
are made up of oscillating dipoles, literal ac- 
ceptance of the extended E of the Maxwell 
equations as the only correct E requires the 
cancellation of the accelerational term (E£;), 
in Eqs. (16), (17), even near the source where 
r<ct. On the other hand the fact that quasi- 
stationary states are ordinarily satisfactorily 


305 


described in terms of dA/dt without mention of 
the cancellation of (E;), by 0¢2/dr indicates 
that E,; is not necessarily thus restricted in deal- 
ling with alternating and oscillating primary 
currents. 

In each of the two forms of classical theory 
mentioned above emphasis is placed on the 
formal extension of divE to include E;, without 
apparent recognition that this supports the pre- 
diction of the reciprocal theories*® that the 
r component of the induced electric field E; used 
in quasi-stationary states need not be effective 
in the radiation zone. In other words the (par- 
tial) success of the Maxwell theory, which in- 
cludes the nonreciprocal Lorentz force and the 
arbitrary extension of E used in Eq. (3) and 
Eq. (11), should not be construed as evidence 
against the nonlocalized reciprocal ballistic the- 
ories which can describe radiation also by as- 
sumption of propagation of potentials. 


SUMMARY 


The concept of a displacement current where 
there is no flow of charge, although it appeared 
useful at the time it was invented, was soon 
found unnecessary. The approximate relation 
curlB/u=kE/c is a consequence of the form of 
the equations representing H and E. This rela- 
tion is assumed exact when H and E are ex- 
tended to include propagation in the classical 
development of the retarded fields and poten- 
tials. This relation is found unchanged but be- 
comes of secondary importance when assuming 
propagation and deriving the relations between 
the potentials and between the fields. In both 
methods the theory gains noticeably in con- 
sistency when such an extraneous concept as 
displacement current is eliminated from this 
relation of B and dE/dt. The concept of dis- 
placement current is a handicap because it 
takes attention off the retarded action of charge 
on charge and the conditions which guarantee 
the transverse electric field in the radiation zone. 
Details suggestive of ballistic emission, which 
are ordinarily ignored in classical theory, become 
evident when emphasis is taken off the smoothed- 
over field theory and more attention is placed 
on the action of charge e’ on charge e (after the 
propagation time) in terms of which the poten- 
tials and the fields are defined. 





Experiments on the Laws of Light Absorption 


Otro BLUH AND Howarp Ko 
University of British Columbia, Vancouver, Canada 


(Received September 22, 1953) 


Attention is called to the importance of an experimental study of the laws of light absorption 
in the intermediate and advanced physics laboratory. A simple arrangement is described for 
measuring the absorption coefficients of solids and liquids, and suggestions are made for 


various experiments with commercial apparatus. 


HE knowledge and understanding of the 

laws of light absorption is obviously im- 
portant not only for the physicist, but also for 
the scientist at large, the student of medicine, 
radiology, and so on. The experiments described 
below have been used for several years in a 
second- and third-year laboratory for premedical 
and biology students, and also in a fourth-year 
biophysics laboratory. They allow a number of 
variations and applications, which offer oppor- 
tunity for doing minor pieces of research that 
will lead to definite results. 

The experimental study of the laws of light 
absorption provides a way of getting familiar 
with the important exponential function, useful 
graphical representations (particularly the use 
of semilogarithmic paper) and various concepts 
and terms such as transmittancy, absorption 
coefficient, half-value thickness, etc. Moreover, 
the ideas pertaining to the absorption of visible 
light are important for the understanding of 
colorimetry, spectrophotometry, and photog- 
raphy, and can easily be extended to cover the 
absorption of x-rays and y rays, which are so 
significant in modern radiology. The mathe- 
matical relationships involved are formally analo- 
gous and can be easily applied to the discussion 
of radioactive decay. 


METHOD AND NOTATION 
The following terms, definitions, and equations 
will be used: 


Po, intensity of light incident on a substance 
(expressed in arbitrary units) ; 


P, intensity of light transmitted through the 
substance (in same units as Pp) ; 


t=P/Po, transmittancy of the 
substance (numeric). 


(1) 


For two substances of individual transmittancies 
7, and r2 the transmitted intensity is 


P1,2=7172Po. (2) 


If m absorbing media, each of transmittancy r, 
are placed in the light path, the final intensity 
P,, is diminished to a value 


P,=T"P». 


A =(Po—P)/P.=1-—1, absorptivity ; 
O=1/r, opacity; 


(2a) 


D=1log,0O, optical density. 


The intensity P, of the light beam after 
passing through an absorbing plate of thickness 
x cm can be related to the initial intensity Po 
by the expression (Bouguer-Lambert) : 


P,=Pye*", (3) 


where a is the linear absorption coefficient, and 
e=2.718 is the base of the natural logarithms; 
the negative sign indicates that P, decreases 
with increasing thickness x. 

Taking logarithms we obtain 


—ax loge = —0.4343ax 


=logio(P:/Po) = —logio(Po/P:). (4) 


The half-value thickness, for which the trans- 
mitted intensity is one-half the incident in- 
tensity, P.=}P , may be found in the following 
manner: 

0.4343ax, = logio2 = 0.3010, 
so that 
x, = 0.3010/0.4343a = 0.693/a. (5) 


For solutions one finds that the linear ab- 
sorption coefficient is, for a limited range of con- 


306 





EXPERIMENTS ON LAWS OF LIGHT ABSORPTION 


centration, proportional to the concentration c: 


(6) 
where ¢ is the molar extinction coefficient, a 
constant characteristic for the particular solu- 
tion; ¢c is given in mole/liter. The equation 


Pi =Pye 


is referred to as Beer’s law. 


a=e€c, 


(3a) 


EXPERIMENTAL ARRANGEMENT AND 
PROCEDURE 


The apparatus used for the basic experiments 
(Fig. 1) consists of a light source L (40- or 
60-watt frosted glass lamp), which can be ad- 
justed in height above a stage S on which the 
absorbing substances are placed. A gas-filled 
photocell PH (RCA 925) is built into a light- 
tight box b. Light is admitted through the ad- 
justable slit aperture SZ. The photocell is in 
series with a B battery B and a high resistance 
(about 107 ohms) ; the photocurrent is measured 
by observing the potential difference across the 
resistance R with the help of a vacuum-tube 
voltmeter VT VM. The deflections on the volt- 
meter scale can be taken as linearly proportional 
to the light intensities P falling onto the 
photocell. 

Experiment 1 


By adjusting the height of the lamp and the 
slit width one produces a convenient (e.g., maxi- 
mum) deflection on the voltmeter scale. Filters 
of neutral grey glass of various transmittancies 
are placed over the stage window and the corre- 
sponding voltmeter deflections read. The trans- 
mittancies of the single grey glass filters are 
calculated from Eq. (1). Two or more filters used 
simultaneously lead to a light transmittancy 
according to Eqs. (2) or (2a). 


Experiment 2 


Transparent plates (glass or plastics) or 
colored cellophane sheets of the same thickness 
are stacked one by one on the stage S and read- 
ings of the P values taken. The full scale deflec- 
tion Po should be frequently checked. In this 
experiment some light losses due to reflection are 
observed with a large number of plates. The 


evaluation of the readings makes use of Eqs. 
(3) and (4). 


Fic. 1. Photoelectric densitometer for the 
measurement of absorption coefficients. 


Experiment 3 


The disturbing factor of light reflection can 
be avoided entirely by using a liquid as ab- 
sorber (see Fig. 1). One places on the stage S a 
crystallizing dish of diameter about 8 cm, and 
fills it to a height of about 2 mm with a con- 
centrated colored solution such as methylene 
blue in water in order to obtain initially a plane 
liquid surface. With the help of a burette having 
a long outlet, known small volumes AV of the 
same solution are added to the liquid, and for 
any new volume and corresponding thickness, 
the transmittancy is observed. If the area A of 
the dish is known (the diameter is measured 
with calipers), one can calculate the increase in 
thickness as Ax =AV/A. This procedure allows 
the increase of thickness of absorber in very small 
steps (provided the opacity of the solution is 
sufficient), and for a great number of steps (say 
20). As before, Eqs. (3) and (4) are applied to 
the resulting measurements. 


Experiment 4 


The same glass dish is filled with solutions of 
different molar concentrations and the corre- 





308 


sponding transmittancies measured. The con- 
centrations may be varied from c to 0.1c. In 
this and the preceding experiment (3) with 
colored solutions, color filters placed in front of 
the photocell can be used to investigate the ab- 
sorption spectrum of the solutions. 


GRAPHICAL REPRESENTATION AND RESULTS 


(1) The results of the observations of Experi- 
ment 1 give an application of Eqs. (2) and (2a). 
One compares the calculated transmittancy with 
the measured transmittancy of the combination 
of filters, and also demonstrates the relation for 
the optical densities D;,2=D,+Dz, etc. 

(2) Experiments 2 and 3 give opportunity for 
using two methods of graphical representation. 
According to Eq. (3), when P, or P,/Pp is plotted 
against x, one obtains an exponential curve (Fig. 
2). From this curve one can easily read off the 
half-value thickness x,, and by using Eq. (5) 
calculate the absorption coefficient a=0.693/x;. 

In order to show that the curve in Fig. 2 is 
exponential, one will prefer to use the linear 
equation (4), and plot logioP, as ordinate (Po 
being constant) against x as abscissa, where 


a ° 
a. 
{ 


Fic. 2. Light-absorption curve. 


OTTO BLUH AND HOWARD KO 























log 10 = log 2-6 = 0-585 





3 80 
cm L1-59 cm] 


Fic. 3. Semilogarithmic plot of light-absorption curve. 


—0.4343a is equal to the slope of the straight 
line. For this graph semilogarithmic paper is 
used. Instead of the actual thickness of the 
absorbing layers the number of equal plates 
(Experiment 2) or the volume of the solution 
(Experiment 3) can be plotted as abscissa (Fig. 
3); for the calculation of the absorption coeffi- 
cient the actual thickness has to be known. 

(3) The measurements in Experiment 4 are 
used to plot a log of transmittancy-concentration 
graph, resulting in approximately a straight line 
that can be used as a calibration curve, so that 
an unknown concentration of the same kind of 
solution can be found by interpolation as in the 
standard practice in colorimetry. Applying Eq. 
(6) one calculates the molar extinction coeffi- 
cient which is found to be constant for a limited 
range of concentration. 


APPLICATIONS 


The foregoing work can be considered to form 
the preparation for a group of experiments using 
more accurate and elaborate commercial appa- 
ratus. In the laboratory for premedicals and 
biologists, etc. three experiments are done in in- 
tervals of four weeks, in the order: (1) light ab- 
sorption, (2) x-ray absorption, (3) colorimetry. It 
is found that the repeated review of the prin- 
ciples involved is important for their thorough 
understanding. In the biophysics laboratory, 
work on these experiments is considered prepara- 
tory, and they are performed in one, or two con- 
secutive laboratory periods, closely succeeded 
by work with more elaborate apparatus as out- 
lined in the following sections. 





EXPERIMENTS ON LAWS OF LIGHT ABSORPTION 


1. Photoelectric Densitometer 


The instrument allows direct readings of per- 
centage transmittancies (100P./P)) and of 
optical densities. One checks the accuracy of the 
readings with grey glass filters of known trans- 
mittancy and corresponding optical density. 
Various optical densities are produced on film 
with visible light and x-rays (dental film), by 
variation of exposure (intensity Xtime) and 
equal darkroom treatment. The optical densities 
plotted against exposure times result in a sensi- 
tometric curve from which the sensitometric con- 
stants of the film material used (contrast, in- 
ertia, etc.) can be read off. 


2. Dubosque Colorimeter 


This instrument uses visual matching of two 
fields by varying the depths x; and x2 of two 
columns of colored solutions with concentration 
c, and ¢2, respectively, one of which is used as a 
standard. The instrument can be used to demon- 
strate Beer’s law [Eq. (3a) _] by keeping constant 
at the matching point the relation: €¢)%; = €c2%2, 
where ¢ is a constant. A calibration curve can 
be obtained, and used for determining unknown 
concentrations. 

A useful adjunct experiment is to observe the 
adsorption of the colored solute on an adsorbent, 
such as filter paper. One prepares a number of 
concentrations of a colored solution (methylene 
blue, fuchsin, etc.) of equal volume, adds to each 
the same amount of adsorbent, decants after a 
certain time, and measures colorimetrically the 
change in concentration. The adsorption iso- 
thermal curve can be obtained by plotting the 
adsorbed amount per unit mass of adsorbent 
against the final equilibrium concentration of the 
solution. A logarithmic representation gives 
opportunity for a comparison with Freundlich’s 
equation A = A,c”", where A is the adsorbability ; 
or logioA =logAo+m loge, from which the ad- 
sorption exponent » can be found. 


3. Photoelectric Colorimeter 


These instruments, a number of which are 
commercially available, allow transmittancies 
to be read from a scale that has to be calibrated. 
They are usually equipped with color filters, 


309 


and can be used like the Dubosque instrument 
in an adsorption experiment. 


4. Spectrophotometer 


These more elaborate instruments use a mono- 
chromator to prepare a narrow range of the 
spectrum reaching from the near infrared into 
the ultraviolet (where a special lamp has to be 
used as light source). The absorption spectrum 
of various solutions can be studied and plotted 
on a graph showing percentage transmittancy 
against wavelength. 

In the biophysics laboratory a chromato- 
graphic study, for example, the separation of the 
chlorophylls and carotenes, can precede the study 
of the absorption spectra of the extracts. 


5. X-Ray Absorption Coefficients 


The measurement of the linear x-ray ab- 
sorption coefficient for different metals can be 
done by using the photocell arrangement (Fig. 1) 
described above. The box b with the photocell 
is placed at a convenient distance under the head 
of a dental x-ray tube. The voltmeter is kept at 
a greater distance than in case of the light ex- 
periment to avoid radiation effects on the ob- 
server. The slit SZ of the box is covered with a 
fluoresecent x-ray screen (Patterson screen) 
with the salt facing the photocell. The fluorescent 
light produced by the x-rays is measured with 
the help of the photocell and vacuum tube volt- 
meter. When zero reading has been established, 
absorbing thin metal sheets are stacked up on 
top of the screen, and the decrease in intensity 
observed. Absorption curves for copper, alu- 
minum, etc., can be plotted according to Eqs. (3) 
and (4), and the half-value thickness, (which is 
particularly used in x-ray work) determined as 
explained before. The linear absorption coeffi- 
cient for x-rays, usually denoted by uy, is found 
from the graphs, and the mass absorption coeffi- 
cient j=,p/p calculated, where p is the density 
of the absorber. 


6. Use of X-Ray Densitometers 


Proceeding as directed in the preceding para- 
graph, one measures the x-ray intensity with 
commercial dosimeters, either thimble chambers, 
or direct reading radiation meters. For non- 





310 


homogeneous radiation one can readily observe 
that the semilogarithmic graph according to 
Eq. (4) does not result in a straight line. One 
should note the ‘“‘point of homogeneity.’’ With 
the more sensitive radiation meters it is also 
possible to observe the effect of scattering of 
X-rays. 


OTTO BLUH AND HOWARD KO 


7. X-Ray Absorption Echelon Filters 


X-ray absorption coefficient determinations 
with this relative method give opportunity for 
photographic work, but the principle of the 
method is involved and not easily understood 
without having Experiment 5 or 6 to precede it. 


Physics of the Glassy State. IV. Radiation-Sensitive Glasses* 


E. U. Conpon 
Corning Glass Works, Corning, New York 


(Received July 14, 1953) 


Effects of radiation in producing color changes, often called solarization, in glass are described, 
in close analogy with modern work on F centers especially in alkali-halide crystals. Photo- 
sensitive glasses in which the action of radiation is to produce centers of nucleation around which 
colloidal or crystalline precipitates can form are also discussed. 


I. LIGHT QUANTA 


OR most purposes the relation of light to 
glass as an optical medium is fully described 
by giving its index of refraction as a function of 
wavelength. If the light is absorbed this is de- 
scribed by giving a complex index of refraction, 
the imaginary part being related to the absorp- 
tion of the light by the glass. Usually the radiant 
energy that is absorbed in an optically absorbing 
medium goes into heat and produces, for sources 
of moderate intensity, only a slight warming. 

But sometimes more profound changes of a 
photochemical kind are produced by absorption 
of light. This is what happens when dyed mate- 
rials fade or when a photographic emulsion is 
affected by light. Such effects are also produced 
in certain kinds of glass, giving rise to a new way 
of studying glass and to a new range of important 
possible applications. This relatively new study 
of the photochemistry of glass is the subject to be 
reviewed in the last lecture. 

Let us start by recalling that, although light 
is propagated as a wave motion, it behaves in a 
corpuscular way when emitted or absorbed by 
matter. Light of frequency v cycle/sec behaves 
as if it were a stream of individual energy units 
called light quanta, or photons, containing an 

* The fourth of a series of lectures delivered at the 


Fifteenth Annual Colloquium of College Physicists, State 
University of lowa, Iowa City, June 20, 1953. 


amount of energy hv (ergs), where h is Planck’s 
constant, h=6.624X10-*" erg sec. This quantum 
aspect of electromagnetic radiation manifests it- 
self over a range of many powers of ten in fre- 
quency, from microwaves to the hardest y rays. 
Here we shall be principally concerned with 
visible and ultraviolet light. 

Commonly the wavelength \=c/y is given, 
instead of the frequency », and for light is ex- 
pressed either in microns (10-4 cm) or angstroms 
(10-§ cm). Visible light ranges approximately 
from 0.7 micron or 7000 angstroms (red) to 0.36 
micron or 3600 angstroms (violet). If \ is in 
microns the size of a quantum is (1.98/d)-10-” 
erg. It is convenient to introduce two other units 
for quanta. One is the electron volt (ev) which is 
the energy gained by an electron in falling freely 
across a potential drop of one volt, 


e/300 = 1.602 X10-” erg. 


This is the energy of a quantum whose wave- 
length is 1.2395 » or 12395A, so the energy of a 
light quantum in electron volts is 


1.2395/X or 12395/r 


according as \ is expressed in yu or in A. 

The other unit involves expression of a mole 
of quanta in terms of calories/mole. Since 
N=6.02 X 107 mole (Avogadro’s number) and 
J =4.185 X10’ erg/calorie, one mole of quanta of 





PHYSICS OF THE GLASSY STATE. 


wavelength X is 


28.4/X or 2.84X105/X kcal/mole 


according as } is in w or in A. 

Radiations in the visible or ultraviolet thus 
correspond to quanta whose molal energy equi- 
valents are comparable with chemical bond dis- 
sociation energies. For example, the quantum of 
the prominent 2537 line emitted by the mercury 
arc corresponds to 4.9 ev or to 112 kcal/mole. 
Radiations in the infrared correspond to much 
smaller quanta. The fundamental absorption 
band associated with the Si-O bond in glasses 
and crystalline silicates is at about \=9uz, for 
which the quantum energy is 0.54 ev or 3.2 
kcal/mole. Such small quanta are usually unable 
to induce electronic transitions—they act directly 
to change the vibrational motion of the atoms in 
the absorbing medium. 

Ordinary glass is thus transparent to visible 
light because the ionic vibrations occur at such 
low frequencies that absorption by them occurs 
in the infrared, and because the first-excited 
electronic levels in it lie so high (>3 ev) above 
the ground level that absorption by electronic 
levels occurs first with ultraviolet light. Colored 
glasses contain materials having electronic levels 
from 1.8 to 3.4 ev, approximately, above the 
ground level, which are able to absorb visible 
light. 

The intensity of a beam of light ought always 
to be expressed in units of energy crossing in 
unit time a unit of area normal to the direction 
of propagation of the beam. In practice it is more 
commonly stated in photometric units, in partic- 
ular the foot-candle. At \=0.55y, the wavelength 
giving maximum visual sensation for unit energy, 
this corresponds to an energy fluxof 17.4 erg/ 
cm? sec or 4.87 X10" quanta/cm? sec. (One foot- 
candle corresponds to 1.08 X 10-* lumen/cm? and 


TABLE 4-I. Wavelength relative-visibility relation. 


A(u) y gy 


IV 311 


the maximum luminous efficiency at \=0.55y is 
621 lumens/watt, which is equivalent to 1.61 
X10 erg/sec for one lumen.) 

For light of other wavelengths a correspond- 
ingly greater energy is needed to give the visual 
sensation called one foot-candle. The energy flux 
in erg/cm? sec is equal to 17.4/g, where 7 is a 
factor known as the relative visibility and is 
listed in Table 4-I. 

The flux of quanta/cm? sec equivalent to 
illumination of one foot-candle at wavelength \ 
in microns is then 


n= (d/9)4.87-10”. 


Another comparison of interest is with the 
dosage required for minimum perceptible ery- 
thema, defined as that which produces a slight 
reddening of the skin of an average untanned 
individual observable 12 to 24 hours after 
exposure. The radiation! of maximum effective- 
ness has \=0.297y and for it the dose is 5-105 
erg/cm?. This corresponds to 7.52-10'* quanta/ 
cm?. 

Light is absorbed in homogeneous media by 
the exponential law, I(x) =I(0)e-**, where k is 
the absorption coefficient in cm~ and x the depth 
of penetration in cm. This means that a fraction 
— (dI/I) =kdx is absorbed in going a distance dx. 
If the number of absorbing atoms in unit volume 
is n(cm~) and each has an effective cross section 
for absorption o(cm*), then the absorption 
coefficient is 


k=no. 


The seemingly implied proportionality of k with 
n is known as Beer’s law. It is true to the extent 
that the structure of the absorbing centers does 
not change with a change in their con¢entration. 
Often there are large departures from Beer’s law 
if the chemical state of the absorbing atoms 
changes with n. Study of variation of experi- 
mental values of k/n with n, is an important tool 
in the study of photochemical mechanisms. 
Suppose now a beam of light of intensity J 
foot-candles, of wavelength \ enters a material 
whose absorption coefficient is k for a time- 
interval ¢ seconds. Then the total dosage, ex- 


1 Coblentz, Stair, and Hogue, Proc. Natl. Acad. Sci. U.S. 
17, 401 (1931). 





312 Zo. we, 


pressed as number of quanta absorbed per cm? 
in the exposure is 


kr quanta 
Q=nktI = Iti—4.87 - 10%. 


9 (A) cm! 


This only applies to the layer in which the 
intensity is J. As I falls off with depth with a 
factor e~**, so also will Q. 

The unit for measuring x-ray dosage is the 
roentgen, defined as the amount of radiation 
which produces 1 esu of charge in 1 cm’ of air at 
0°C and 760 mm. Letting e=charge on electron 
in esu, 1 esu corresponds to formation of e~ ion 
pairs, per cm*; so the number of ion pairs per 
cm? produced by r roentgens is r/e. 

If the time of exposure is ¢ sec, I is the intensity 
of the x-ray beam in erg/cm” sec, A is the energy 
in erg/ion pair needed to produce an ion pair on 
the average, and y4(cm~) is the linear absorp- 
tion coefficient of the x-rays in air, then 


It=rA/pae erg/cm?. 


If B is the energy per ion pair in another 
material, whose linear x-ray absorption coefficient 
is ue(cm~), and it is exposed to the same dosage, 
(the same Jt), then 


rA/pae = N2pB/us, 


where Vz is the number of ion pair/cm* formed 
in material B, so 


Np=(A/B)(us/pa)(r/e), 


where N,4=r/e, as stated in the foregoing, is the 
number of ion pairs formed in air for a total dose 
of r roentgens. 

Since with x-rays the mass absorption coeffi- 
cient is more nearly constant from element to 
element, it is convenient to write, if p is the 


TABLE 4-II. Biological effects of x-radiation. 





Roetgens 
(one dose) 


<25 
25-50 
50-100 

100-200 

200-400 

400 

600 


Effect 





none 

blood changes, no serious injury. 
some injury, no disability. 

possible disability. 

disability certain, death possible. 
fatal in approximately half the cases. 
fatal in nearly all cases. 





CONDON 


density in g/cm*, 
Np=C(A/B)par(epa)— ion pair/cm', 


where C= (u/p) a+ (u/p)a. 

If E is the energy of an x-ray quantum, the 
number of ion pairs formed per quantum is 
(E/B) and therefore the total number of quanta / 
cm? absorbed in material b is 


Qs=BNp/E=Cpar(A/E) (eps) quanta/cm’. 


For practical applications it is convenient to 
insert the experimentally determined? value 
A = 33 ev or 33/1000 kev and to express E also in 
kev so 
Cp pr 33 
B= 10+ 
E epa 


Cp pr 
= — (5.35-10") quanta/cm*. 


The ion pairs are not uniformly distributed 
throughout the volume but occur in clumps, 
averaging (Z/B) ion pairs per clump, each one 
formed around the site of absorption of an x-ray 
quantum. 

As an orientation on biological effects of 
x-radiation for a single dose over the whole 
human body, Table 4~—II is useful.’ 

The tolerance dose rate adopted by the U. S. 
Committee on X-Rays and Radium Protection 
is 0.3 roentgen/week. On the other hand, highly 
localized doses up to 1000 roentgens on small 
parts of the body may be used in cancer therapy. 


II. CONSEQUENCES OF PRIMARY LIGHT 
ABSORPTION 


In the study of luminescent materials it is 
often the case that the ability of the material 
to absorb the stimulating visible or ultraviolet 
light is due to the presence of an accidental 
impurity, or a deliberately added constituent, 
present in small amount. This active constituent 
is called an activator. For example, pure ZnS 
shows almost no luminescence, but can be 
activated by proper incorporation in its lattice of 

2 W. V. Mayneard, Rept. Progr. in Phys. 14, 396 (1951). 
See also ‘Report of a discussion on radiation chemistry,” 
The Faraday Society (1952). 

3 Los Alamos Scientific Laboratory The Effects of Atomic 


Weapons (Government Printing Office, Washington, D. C., 
1950), Chaps. 8 and 11. 





PHYSICS OF THE GLASSY STATE. 


10-* parts of Au, Ag, Cu, Mn, and some other 
metals. In such cases the important effects are 
associated with the electronic properties of what 
Lenard called active centers; that is, the activator 
atoms and their immediate surroundings. Thus 
the properties of the active centers depend both 
on the activator and on the base material which 
surrounds it. A further complication arises in the 
fact that some properties depend on the simul- 
taneous presence of two activators. Such mater- 
ials are said to be coactivated; this happens in the 
solarization of glass as discussed later. 

The primary act of absorption of visible or 
ultraviolet light raises one of the electrons to a 
higher-energy level (Fig. 14) in a process that is 
generally accompanied by a change in the vibra- 
tional motions of those atoms whose bonding is 
affected by this change in state of the electron. 
The nuclear motions thus associated with elec- 
tron transitions are governed by the Franck- 
Condon principle,‘ first worked out in 1926 for 
diatomic molecules, and since found to be appli- 
cable to polyatomic molecules and to vibrational 
motions in condensed solids. 

For ultraviolet radiation the energy given the 
electron may not be enough to free it from its 
local bonds, or if so will not give it more than one 
or two electron volts in addition to what it takes 
to free it as in a of Fig. 14. Therefore he distance 
it goes before being captured again can hardly 
be more than a few interatomic distances. It may 
be captured into some excited levels at d, which 
may be metastable, or it may lose energy by 
collision in such a way that it is returned essen- 
tially to a ground level, as by process c, without 
emitting radiation. In this case the fluorescence 
is said to have been quenched. With x-rays the 
primary electron is given some tens-of-thousands 
of volts of energy as in a’. Even so it does not go 
far, in a macroscopic sense, before it loses this 
energy and is trapped somewhere. It loses the 
energy by electron impacts which excite and set 
free other electrons indicated by b’ and c’. The 
electrons are trapped so quickly that negligible 
conductivity results but it seems reasonable to 
assume the number knocked loose is of the order 
E/33 per quantum, where E is the quantum 
energy in volts, this being the number of ion 


4E. U.-Condon, Am. J. Phys. 15, 365 (1947) reviews its 
development. 


ry 


Fic. 14. Scheme of electron transitions in glass. 


pairs produced by x-ray ionization in air where 
there is no trapping and ion recombination is 
slow enough that the resulting conductivity can 
be measured. 

The excited electron may return at once 
(within about 10-8 sec) to substantially the same 
electronic level as it came from, but again ac- 
companied by associated changes in the vibra- 
tional levels, radiating a suitable light quantum. 
This is what happens in fluorescence. Usually the 
emitted quantum will be smaller than the inci- 
dent quantum leaving some vibrational energy 
in the lattice as heat.’ Thus the fluorescent light 
is mostly of longer wavelength than the incident 
light. This is known as Stokes’ law. Probably 
there are many unreported cases of fluorescence 
in which the emitted radiation is in the infrared. 
Minor violations of Stokes’ law occur because 
the initial levels are really a Boltzmann distri- 
bution over about 1/40 ev at room temperature 
and the electron may have been initially in a high 
thermal state from which it returned by a 
quantum jump to a low one. In this way the 
fluorescent quantum can exceed the exciting 
quantum by about 1/40 ev. 

The electron may be trapped in a metastable 
excited level, by making a radiative transition to a 
high level from which further transition back to 
the normal state is not allowed as in d. By ther- 
mal agitation it will have a small probability of 
returning, by way of allowed radiative transitions 


5 A. Einstein, Ann. Physik 17, 132 (1905). 





314 


to the normal state, (radiative process e) and so 
will give off light for an extended period of time 
after the incident light is cut off. This is called 
phosphorescence. Since this depends on the 
electrons passing over an activation energy 
barrier, the rate at which the stored energy is 
given off as light will increase rapidly with temp- 
erature. Thus it may happen that at room temp- 
erature the rate is so low that the material does 
not seem to give off light and on warming the 
rate is high enough that it glows brightly for a 
time. This is thermoluminescence. 

If the electron is trapped in a metastable level, 
there is generally another set of higher levels 
above it to which it may now make allowed 
transitions by absorbing light, as in process f. 
Thus a material having a large number of such 
electrons in metastable states has a different 
absorption spectrum than the normal material. 
The normal material may absorb ultraviolet 
light only, but the metastable electrons may be 
able to absorb in the visible. This is the basic 
mechanism by which normally colorless trans- 
parent materials may become colored. The 
metastable trapped electrons (and the surround- 
ings to which they are coupled), which are 
responsible for the new absorption bands, are 
called color centers. 

Absorption of light by the electrons in color 
centers may put the electrons into still a different 
class of metastable states giving rise to power to 
absorb still other new bands, as in process g. 
Thus absorption of light in the color-center band 
may induce a change which brings in an addi- 
tional color change in the material; one which 
could not have been produced by light acting on 
the original material, because before the color 
centers were formed it could not absorb visible 
light. 

Warming the material, in general, will help 
the electrons trapped at color centers to return 
to their normal state with or without noticeable 
thermoluminescence. Thus warming the material 
will bleach out the color center absorption bands, 
returning the material to its original colorless 
condition. Also electrons may be assisted to get 
out of color centers by direct absorption of 
infrared light which may therefore act both to 
increase the brightness of phosphorescence and 
to bleach the material to its original transparency. 


E. U. CONDON 


Also the excited electrons in some instances 
may move comparatively freely through the 
material giving rise to photoconductivity. 

The primary act of absorption of ultraviolet 
light by an electron may thus give rise to a wide 
variety of interesting and important effects.® 

Experimental study of these processes has 
produced an enormous literature of early work, 
carried out before the basic theoretical picture 
was clarified. As the effects observed are ex- 
tremely sensitive to small impurities, and the 
importance of controlling these was at first not 
recognized, it is not surprising that in many 
cases the older literature is almost valueless 
except for the qualitative indications it gives. 
Careful studies on crystals of the alkali-halide 
type with known and controlled added materials 
and also the modern industrial study of crys- 
talline phosphors recently have done much to 
remedy what, but a few years ago, was a most 
unsatisfactory situation. 

Even so the quantitative study of radiation 
sensitivity of glass is still in a very primitive 
state. Nevertheless some interesting and useful 
results have been achieved. Stimulation by 
x-rays and vy rays and also bombardment by 
high-energy cathode rays or by particles gives 
rise to phenomena broadly similar to those 
produced by ultraviolet light, with differences 
of detail. No attempt will be made here to 
review this field.’ 


Ill. SOLARIZATION OF GLASS 


By solarization is meant a change in the 
absorption spectrum of the glass due to prior 
absorption of light. Usually the light absorption 
causing the change is in the ultraviolet, and, if 
the change also occurs in the ultraviolet, the glass 


* For reviews of this field; (a) N. F. Mott and R. W. 
Gurney, Electronic Processes in Ionic Crystals (Oxford 
University Press, London, 1948), Chapters 6 and 7; (b) 
F. Seitz, The Modern Theory of Solids (McGraw-Hill Book 
Company, Inc., New York, 1940), p. 459; (c) F. Seitz, Revs. 
Modern Phys. 18, 384 (1946); (d) G. F. J. Garlick, Repts. 
Prog. in Phys. 12, 34 (1949); (e) F. Seitz, Revs. Modern 
Phys. 23, 328 (1951); (f) Cornell Symposium of American 
Physical. Society, Preparation and Characteristics of Solid 
Luminescent Materials (John Wiley and Sons, Inc., New 
York, 1948) 

7K. H. Sun and N. J. Kreidl, Glass Industry 33, 511, 
589, 651 (1952). a also Mayer and Gueron, J. Chem 
Phys. 49, 204 (1952) 

® For a review, see W. A. Weyl, Coloured Glasses (pub- 
lished 7 Society of Glass nn Sheffield, 1951), 
Chap. 3 





PHYSICS OF THE GLASSY STATE: 


may seem to be unaltered so far as visible optical 
properties are concerned. If the new absorption 
band induced by absorption of the ultraviolet is 
in the visible, then the effect of solarization is to 
produce a coloration of the glass. 

The most common instance of this is in the 
purple tinting of old windows or of old bottles 
thrown away on the western deserts, where they 
lie for years exposed to the ultraviolet rays of the 
desert sunlight. The effect was first described by 
Michael Faraday in 1825. In the early days of 
photography (mid 19th century) when the plates 
were mainly affected by the “‘actinic’’ rays of the 
near ultraviolet and photographers’ studios used 
natural sunlight, this property of window glass 
caused difficulty. The explanation of the mech- 
anism of the purple tinting, which in a modernized 
form is accepted today, goes back to Pelouze in 
1867. 

Nowadays many such solarizable glasses are 
known. In every case the basic action is believed 
to be the absorption of light by an ion in a lower 
state of oxidation, which frees an electron to go 
elsewhere in the network; it perhaps being 
captured in a region of high positive potential 
which might have held another oxygen ion, or 
perhaps being captured by another ion of a differ- 
ent kind which is thereby reduced to a lower state 
of oxidation. Schematically, the light is absorbed 
by ion A in a reduced state which is thereby 
oxidized, the electron then being captured by 
ion B oxidized which is thereby reduced. 

The process is shown schematically in Fig. 24, 
where A, and A, stand for the relatively reduced 
and oxidized states of A, respectively, such as 
Fe++ and Fet*+ in the case of iron. When the 
electron is captured by B, in the more oxidized 
state there is a binding energy to be given up 
which may happen by fluorescence or by direct 


Fic. 2,. Solarization mechanism. 


IV 315 
loss to vibrational modes of the network (sym- 
bolized by solid and dotted lines, respectively.) 

The essential things are (a) the presence in the 
glass of ions which can be further oxidized by 
absorption of the light in question and (b) some 
means of capturing the ejected electron other 
than having it be captured by some A, ion which 
would thereby be reduced to the original A, ion 
leaving no net change. 


From this simple picture several consequences 
follow : 


(a) If the base material can capture electrons 
readily, then the light can change the state of 
oxidation of A without the presence of B. 

(b) The exact optical properties of A, depend 
on its coordination to surroundings and so may 
be different from those obtained from other A, 
ions originally present in the melt; i.e., the light 
generates A, ions having the network ties 
appropriate to A, ions which may be different 
than the network ties usually appropriate to 
A, ions. 

(c) If the glass contains both A, and A, ions 
in metastable equilibrium and the total concen- 
tration of A is increased enough, then the freed 
electrons may be captured by other A, ions 
present, thus nullifying the over-all effect, or at 
least diminishing the number of electrons that 
are captured by B,. If B capture is responsible for 
fluorescence, this means that increased concen- 
tration of A, whose presence is necessary for 
fluorescence due to B, will result in a diminished 
amount of fluorescence. 


These principles are illustrated in many 
combinations and, of course, can be complicated 
by the simultaneous presence of several kinds of 
polyvalent ions, A, B, C --- in the glass. 

Three specific examples may be mentioned: 


(1) The purple tint of glass containing Mn and 
Fe, already mentioned, is due to the reaction 
Mntt+-+>Mnt++(e)Fet++—Fet+. The resulting 
increased Mn+++* concentration gives the purple 
tint. At the same time light acting on the Fett 
ions produces more Fe+++ which is believed to be 
the cause of decrease in the ultraviolet trans- 
mission of such glasses. 

(2) Glasses containing Ce and As solarize 





316 E. 


readily, developing yellow or even dark brown 
coloration at larger cerium concentrations.® 

In Ce containing glasses, both Ce*+ and Ce*t 
ions are present. The first act of light absorption 
gives Ce*+—>Ce!+ and a free electron. Some of the 
electrons are either not completely freed or 
return to the Ce** ion to emit a blue fluorescence 
which extends into the ultraviolet to about 
3000A. If As is also present, electrons can be 
captured by the As** ions resulting in yellow- 
brown coloration. 

(3) Glasses containing Ce and V show a striking 
color change on solarization. The electrons are 
freed by light absorption in Ce**, as before. The 
initially present V** ions give the glass a green 
color. The freed electrons reduce these to V?t, 
first producing a grey and finally a purple color 
as the reduction becomes essentially complete. 


All these solarization effects can be reversed 
by heat treatment, to some extent by gentle 
warming, but usually requiring 400 or 500°C for 
anything like complete restoration of the proper- 
ties of the original glass. Reversal can in some 
cases also be caused by red or infrared radiation 
indicating that the trapped electrons are in such 
cases not as tightly held as in the original ion 
from which they were released. This indicates 
that the actual net effect produced when glass is 
exposed to all the frequencies in natural sunlight 
is due to a balance of different tendencies pro- 
duced by different wavelengths. 


IV. PHOTOSENSITIVE GLASSES 


Usually the color changes produced by solari- 
zation are not very deep, corresponding more to 
a tinting than a coloring of the glass. In recent 
years new principles have been discovered in the 
research laboratories of Corning Glass Works 
largely by Stookey” building on earlier work by 
R. H. Dalton and W. H. Armistead. These are 
opening up interesting new research and appli- 
cation possibilities. 

The photosensitive glasses are based on the 
general principle that changes in the state of 


9 F. Eckert and K. Schmidt, Glastech. Ber. 10, 80 (1932). 

1S. D. Stookey, (a) J. Phot. Soc. Am. 14, 399 (1948); 
(b) Ind. and Eng. Chem. 41, 856 (1949); (c) J. Am. Ceram. 
Soc. 32, 246 (1949); (d) Pages 697-706, Vol. 7 of Alex- 
ander’s Colloid Chemistry (D. Van Nostrand Company, 
Inc., New York, 1950). 


U. CONDON 


oxidation of certain metal ions contained in the 
glass may make it possible for the active centers 
so formed to act as nuclei for crystal growth. 
These may, on subsequent heat treatment, 
coagulate to form colloidal metal particles. These 
can give colloidal colors of greater density than 
those produced in ordinary solarized glass, and 
permit the making of photographic prints of good 
quality in the glass itself. Because the process 
involves coagulation into colloidal particles, it is 
not reversible on heating in the same sense that 
solarization is reversed by heat, that is, by elec- 
tron transfer. At sufficiently high temperature, 
prolonged heating, however, will dissolve the 
colloidal particles in the glass, destroying the 
picture, and making it possible to use the same 
piece of glass for another picture. 

A small concentration of cerium oxide is used 
in a glass which may contain also Ag, Au, or Cu 
ions. For definiteness consider the case of copper. 
The glass is melted under mildly reducing con- 
ditions. If strongly oxidized the stable blue Cut++ 
ions are formed which cannot be reduced to the 
metallic state by heat treatment, while, if 
strongly reduced, metallic copper forms in the 
melt giving a brown cloudy glass. 

The most effective radiation for exposure is in 
the range 3000—3500A and exposures of the order 
of 2 milliwatt-minutes/cm? are used. This 
corresponds to an exposure of about 2-10" 
quanta/cm?, roughly of the order of magnitude 
of the total number of cerium ions contained in 
an area of 1 cm? and a thickness of a few milli- 
meters. 

It is believed that the electrons liberated by 
the oxidation Ce*+—>Ce‘+ are captured by the 
cuprous ions to reduce them to neutral copper 
atoms, Cut—Cu. This constitutes the formation 
of the latent image. 

Development consists of holding the glass at a 
temperature between its annealing and softening 
temperatures. The development time decreases 
logarithmically with increase in temperature, but 
not as rapidly as does the viscosity of the same 
glass in this range, indicating a lower activation 
energy for migration of the atoms toward each 
other than is involved in viscous flow. A rough 
calculation from curves in Stookey’s paper®® in- 
dicates an activation energy of 105 kcal/mole for 
viscous flow and only 40 kcal/mole for the de- 





~ wF- ~~ Ot Dn NS & 


PHYSICS OF THE 


velopment process. This roughly parallels the 
comparatively low activation energy found for 
ion mobility in electrolytic condiction in glass 
as compared with that for viscosity, noted in the 
second lecture. In practice one carries out the 
development at as high a temperature as possible 
without deforming the glass, usually close to 
580°C. In passing it may be noted that the glass 
after exposure but before development shows the 
printed picture very faintly. This is attributed to 
the occurrence of solarization by some other 
mechanism, for this faint image clears up quickly 
when the glass is warmed to less than 400°C and 
does not recur if the glass is cooled from that 
temperature without development. 


V. PHOTOSENSITIVE OPAL GLASSES 


Opal glasses are of many kinds, but most of 
them depend on some constituent of the glass 
becoming supersaturated and precipitating out 
in a cloud of minute crystals when the glass is 
cooled. The various phenomena familiar in other 
connections of tendencies toward supersaturation 
and the need for crystallization nuclei to start 
crystal growth are also to be found here. 

Stookey finds that he can take certain opal 
compositions and sensitize them by the same 
means as described for photosensitive glasses. 
They then have the property that the opal phase 
forms, on suitable heat treatment, in the part 
that has been exposed to ultraviolet light, but 
the glass remains clear and transparent in the 
part that has not been exposed. This material is 
now in commercial production for various light- 
diffusing illuminating fixtures and other uses. 

The basic mechanism of formation of the latent 
image is presumed to be the same as before. 
Initial heat treatment to nearly 600°C is used to 


GLASSY STATE. IV 317 


develop the metallic nuclei. It is then necessary 
to drop the temperature to about 520°C to start 
the growth of the crystals of the opalizing phase 
on these metallic nuclei. After this the tempera- 
ture is again raised to 550°C to speed up crystal 
growth in the opal phase. 

Still another ramification of importance in 
applications occurred with the discovery" of an 
additional property of some of these photosensi- 
tive opal glasses. It is that the opalized phase 
dissolves much more rapidly in dilute hydro- 
fluoric acid than does the glass in which the opal 
phase is not developed. This makes possible the 
precise forming of glass articles by photo- 
engraving techniques, including in fact the 
making of glass half-tone engravings of good 
quality for printing. This method of making 
engravings for printing will have distinct 
advantages in some fields. The chemically 
machinable glass is also of importance for other 
applications such as the aperture masks in one 
type of color television tube. 


CONCLUSION 


In these lectures I have tried to indicate to you 
that there is a great deal of scientific interest 
in the problems connected with trying to under- 
stand the structure and properties of glass. 
Because of the limited time it has been necessary 
to deal with only a few of the topics and with 
these only rather superficially. I hope that some 
of you will be stimulated by this inadequacy of 
the discussion to delve more deeply and be 
richly rewarded by learning for yourselves about 
the many fascinating and still unsolved problems 
of the Physics of the Glassy State. 


1S, D. Stookey, Ind. Eng. Chem. 45, 115 (4953). 


Errata: Nuclear Quadrupole Resonance 


H. G. DEHMELT 
Duke University, Durham, North Carolina 
(Am. J. Phys. 22, 154 (1954) ] 


P. 113: read reference 11a, 11b instead of 12, 13 in text. 
P. 117: footnote 24a should read: Compare references b and c to Table I. In the caption of 


Fig. 4 read: from reference K to Table I. 


P. 118: In Table I: Ga” instead of Ga!”. The missing reference for the Ga-isotopes is H. G. 


D. H. Dehmelt, Phys. Rev. 92, 1240 (1953). 
Table I. 


In the caption to Fig. 5 read: from reference b 





On the Theory of the Van de Graaff Electrostatic Generator 


A. W. Smmon 
Alabama Polytechnic Institute, Auburn, Alabama 


(Received November 18, 1952) 


The detailed quantitative theory of a two-carrier, constant potential, electrostatic generator 
is given. It is shown that if the inductor of such a machine is held at constant potential, the 
collector voltage also approaches a constant value, which is proportional to the potential of the 
inductor, the factor of proportionality depending only on the geometry of the machine. It is 
shown that a Van de Graaff machine can be considered as a generator of the same type except 
that the two separate carriers are replaced by a single continuous belt. Some erroneous con- 
clusions commonly made with respect to the action of the Van de Graaff machine are dis- 
cussed. It is pointed out that a possible advance in the application of belt-type generators is 


operation in series to attain higher voltages. 


O* the basis of a principle of electrostatics 
which states that charge resides only on the 
surface of a conductor, it is generally concluded 
that in the case of the Van de Graaff electro- 
static generator the belt will continue to transfer 
charge to the collecting electrode (the insulated 
sphere) until its potential rises to such a value 
that electrical breakdown, either in the form of 
gaseous discharge (corona) or, under more ideal 
conditions, arc-over occurs. Four example, in one 
of the original papers! by Van de Graaff and his 
co-workers it is stated: ‘‘Since the interior of 
the sphere is similar to the interior of a Faraday 
icepail, the charge passes readily between the 
charged belt and the sphere, irrespective of the 
potential of the sphere.”’ And further: “The 
voltage which is attainable by this device is 
limited only by the corona breakdown at the 
surface of the spheres.”” Also, with respect to 
the current, it is stated: ‘The current output 
is equal the rate at which charge is carried into 
the spheres and is therefore equal to the product 
of the surface density of the charge on the belts 
multiplied by the areal velocity with which the 
belts enter the spheres.”’ 

Textbooks in physics apparently have blindly 
followed the lead of Van de Graaff and his co- 
workers in repeating these theses. For example, 
one’ states: ‘‘in the Van de Graaff generator...a 
charged belt discharges completely to the out- 
side of a sphere, thereby raising the sphere to 


1Van de Graaff, Compton, and Van Atta, Phys. Rev. 
43, 152-153 (1933). 


2 G. Shortley and D. Williams, Elements of Physics (Pren- 
tice-Hall, Inc., New York, 1953), p. 709. 


high potential.’’ Another® states: ‘“The Van de 
Graaff generator makes use of the principle that 
if a charged conductor is brought into internal 
contact with a second hollow conductor, all of its 
charge transfers to the hollow conductor, no 
matter how high the potential of the latter may 
be. Thus, were it not for insulation difficulties, 
the charge, and hence the potential, of a hollow 
conductor could be raised to any desired value.” 

As a matter of fact none of these statements 
are correct, nor is the so-called Faraday icepail 
principle the proper one to be used for the ex- 
planation of the action of the Van de Graaff gen- 
erator. In the first place, the belt itself is not a 
conductor, nor is it in direct conducting com- 
munication with the sphere, but charge is trans- 
ferred from the belt to the sphere through the 
medium of a so-called Townsend or gaseous 
electric discharge, this involves obviously the 
interposition of a considerable potential drop be- 
tween the belt and the sphere. On the basis of 
this fact alone it could be concluded that not all 
of the charge is transferred from the belt to the 
sphere. Furthermore, the Faraday principle 
applies rigorously only to the case where the 
conductor carrying the charge is completely en- 
closed by the one receiving the charge, that is 
the icepail must be provided with a tight-fitting 
conducting cover before all the charge is trans- 
ferred to it. As the experiment is ordinarily per- 
formed, with a relatively insensitive electroscope 
or Braun electrostatic voltmeter connected to the 
icepail, the fact that some charge remains on the 


3Sears and Zemansky, University Physics (Addison- 
Wesley Press, Cambridge, 1952), p. 438. 


318 





— lS ae la eC 


[= 


e 
é 
e 


l- 








carrier when an open top, that is not completely 
closed, icepail is used, is not detected. It. is, of 
course, impossible to build a Van de Graaff 
generator with a completely closed conducting 
shell, since entrance and exit ports must be 
provided for the belt, if not also for other struc- 
tures, such as insulating supports, etc. However, 
a much more fundamental consideration than 
any of the foregoing is that a continual repetition 
of the Faraday icepail experiment (procedure) 
would not necessarily be expected to yield the 
same result as a single trial. In fact, as will be 
shown presently, under these conditions the 
charge, and therefore the potential, of the pail 
would approach exponentially an equilibrium 
value, and would not continue to increase in- 
definitely as has generally been assumed. 

The correct principle to be applied for the 
explanation of the action of the belt-type gen- 
erator is that, since the charge on the belt is 
moved against the attraction of the oppositely 
charged inductor (the plate of the lower charging 
system) and the repulsion of the similarly 
charged collector (the sphere) work is done, and 
this work appears as increased electrical energy, 
and therefore increased potential, of the charge 
on the belt, with the result that the charge on the 
belt arrives at the sphere with a potential higher 
than that of the sphere, and, consequently, 
shares its charge with the sphere. That is Volta’s 
condensing electroscope principle, rather than 
the Faraday icepail principle, should be used as 
a basis for the explanation of the action of the 
Van de Graaff machine. The latter would also 
function if the belt were carried up entirely 
outside of the sphere, but insulation difficulties 
would naturally increase due to the necessity 
of supporting pulleys, bearings, etc., externally 
to the sphere. Of interest in this connection is 
a frictional electrostatic generator constructed 
by Busch,‘ in which an endless paper belt passing 
over cylindrical rollers is electrified by rubbing 
with catsfur, and the charge is collected by a 
serrated piece of sheet metal placed transversely 
to the belt and connected to a metallic sphere 
separately supported. In this machine the small 
contact emf between the rubbing substances 
nevertheless suffices to raise the metallic sphere 


4 Graetz, Handbuch'der Elektrizitét und des Magnetismus 
(Johann Ambrosius Barth, Leipzig, 1918), Vol. 1, p. 28. 





THEORY OF ELECTROSTATIC GENERATORS 









319 





to very high potential without the necessity of 
the belt entering the sphere. 

In order to understand the action of electro- 
static generators, the Van de Graaff machine in 
particular, it is instructive to follow the phe- 
nomena of the condensing electroscope some- 
what in detail.® 

Let it be supposed that in the initial position 
of the plates of the condensing electroscope, that 
is, with the upper plate resting on the lower 
plate but with a thin layer of dielectric inter- 
posed, the two are charged by applying between 
them the emf of a battery, say of the order of 
several volts. The electroscope leaves show no 
divergence, since a voltage of this order of magni- 
tude will not affect them. If the battery is now 
disconnected and the upper plate lifted, work is 
done against the attractive forces of the two 
charges, and this work appears as increased 
electrical energy, and therefore increased poten- 
tial of the system, whence the electroscope leaves 
diverge. 

Quantitatively, if we denote by W: and W, 
the electrical energy of the system before and 
after lifting off the upper plate, respectively, by 
V, and V, the corresponding potentials, and by 
Q the charge on the condenser, we have 


Wi=QV,/2 and W.=QV;/2, 


the amount of energy W2—W, being supplied in 
the act of lifting off the upper plate. Since the 
energy of a condenser is given by CV?/2, we 
have also 


Wi=C,Vi2/2 and We=C2V2/2 
or, if we combine these equations 
W2/W, = V2/ Vi = Ci/C2. 


The last relationship can also be deduced directly 
from the fact that since the charge on the con- 
denser remains unchanged, we can put 


Q= CiVi= C2V2, 
whence 


V2/ Vi = Ci/C2. 


A beautiful demonstration of these ideas is to 
take a (well-insulated) variable air condenser, 
connect across it a low capacity electrostatic 


5 A. W. Simon, J. Franklin Inst. 204, 617 (1927). 














320 


voltmeter, such as one of the Braun type, charge 
the condenser when its capacity is a maximum 
to a definite voltage, then turn the dial toward 
minimum capacity, the voltage is observed to 
rise in inverse proportion to the capacity. In 
practice with a standard 1000-mmf condenser 
the voltage of two 45-volt B batteries connected 
in series is easily transformed to 1500 volts. 

Returning now to the condensing electroscope 
if, after having been lifted off the upper plate is 
connected to another conductor, say one plate 
of second condenser, charge will be transferred 
to the latter, and by a repetition of the process of 
charging the condensing electroscope, lifting 
off the upper plate and connecting it to a second 
condenser, the latter would continue to charge 
until its potential became equal to that of the 
upper plate at the instant it is connected to the 
second condenser. More effective action would 
obviously be obtained if the lower plates of both 
condensers were connected together. 

A simple machine actually in the form of a 
rotating, variable, air condenser, charged when 
its capacity is at a maximum and discharged, 
that is connected to a second condenser to be 
charged, when the capacity of the first is at or 
near a minimum, has been described by the 
author.® 

In connection with this machine it was shown 
that if we denote by V, the potential of the 
second condenser after ” turns of the rotor of 
the first, by V. the equilibrium potential, by Ci 
the capacity of the variable condenser in the 
charging position, by C2 the capacity in the dis- 
charge position, by C; the capacity of the con- 
denser to be charged, and by Vp the charging 
voltage applied to the variable condenser, we 


Fic. 1. Schematic diagram of a two carrier, constant 
potential, electrostatic generator. 


* A. W. Simon, J. Opt. Soc. and Rev. Sci. Instr. 9, 345 
(1924). 


A. W. SIMON 


have formally 


Va=Va(1—K*") = V.(1—€*"), 
where 
K=C:/(C2+Cs) 
and 
Vom. Vo/Co. 


These equations predict that the voltage 
across the second condenser will build up accord- 
ing to an exponential law to an equilibrium value 
equal to the potential across the variable con- 
denser at the instant contact is established with 
the other. 

The simple theory given, however, is not 
rigorously correct because it ignores a rather im- 
portant fact that while the upper plate receives 
its charge it is influenced electrostatically also 
by other charges in the field, in particular by the 
charge on the conductor being raised to high 
potential, and this has some unexpected con- 
sequences, as we shall see. 

In order to determine how a Van de Graaff 
machine might be expected to perform, let us 
develop the rigorous quantitative theory of a 
simple electrostatic generator, in particular one 
which might be considered as the prototype of 
this species of electrostatic generator, and sub- 
sequently extend this theory to more complex 
machines, such as exemplified by the Van de 
Graaff. 


FUNDAMENTAL EQUATIONS OF A SIMPLE 
ELECTROSTATIC GENERATOR 


Consider (Fig. 1) a simple nonregenerative 
electrostatic generator, which comprises an in- 
ductor 1, maintained at a constant potential 
V relative to earth by means of battery, or other 
source of constant electromotive force, a collector 
2, and two revolving carriers ; and which operates 
in such a way that when either carrier occupies 
the position 3 directly in front of the inductor 
it is connected momentarily to earth, while in 
position 4 it is connected momentarily to the 
collector. In position 3 then the carrier receives 
a charge, which it shares with the collector 2 in 
position 4. The problem is to find the potential of 
the collector after any integral number of 
half-turns or geometric cycles. 

As regards the physical form of the elements 
of the machine, these might take the form of 









nts 





spheres or cylinders with hemispherical ends, or 
both. It is assumed, however, that the carriers 
are identical in shape and that they break contact 
simultaneously with the respective brushes. It is 
assumed also that there is no loss of charge by 
electrical leakage. A two-carrier machine is 
chosen rather than a single-carrier machine be- 
cause the equations actually turn out simpler 
in the former case. 

Returning now to the quantitative theory’ of 
the machine of Fig. 1, if we denote Q;(”) the 
charge after m cycles, or half-turns, of the 
conductor occupying the position indexed 7, at 
the instant the carriers break contact with the 
brushes, we have 


Qo(n+1)+Qs(n+1) =Qe(m)+Qa(n), 
n=0, 1, 2,3, etc. (1) 


This equation merely states that the charge 
on the system comprising the collector and the 
carrier in contact with it, at the end of any cycle, 
in particular the (~+1)st, is equal to the charge 
which was present on the collector alone plus 
the charge with which the carrier left the position 
in front of the inductor at the end of the previous, 
that is the mth, cycle. Equation (1) expresses 
conservation of charge. 

If next we denote by V;(m) the potential after 
n cycles of the conductor occupying the position 
indexed 7, by C;; the coefficient of capacity of the 
conductor occupying the position indexed 7, by 
C;; the coefficient of induction of the conductors 
occupying the positions indexed 7 and j, and 
substitute in Eq. (1) for each charge its value in 
terms of the corresponding electrostatic coeff- 
cients and potentials, we have 


(Cio+ Cis) V+ (Co2+ Cost Cao+ Cas) V2(n+1) 
= (Ciot Cis) V+ (Co2+ Cost Coot Cas) Vo(m). 


The potentials V3(”) and V3(m+1) do not appear 
in this equation because the corresponding con- 
ductor 3 is at ground, that is zero, potential. The 
last equation can be written more compactly if 
we make use of a notation developed by Max- 
well, in particular if we set 


Coan ™ Cort Cot Cort Com (2) 


7 The general method followed here is that given by the 
author in the J. Franklin Inst. 237, 177-196 (1944), to 
which the reader is referred for additional details. 


THEORY OF ELECTROSTATIC GENERATORS 





whence it takes the form 


Ci, 24V+ Coa, 24V2(n+1) 
=C,23V+Cor423V2(m). (3) 


SOLUTION OF THE FUNDAMENTAL EQUATION 
OF THE GENERATOR 


Equation (3) can be written still more com- 
pactly if we set therein identically 


b=— Crs, 23) 


whence it takes the form 


= Cr, 24y 


c= Ci, ae Ci, 24) (4) 


aV2(n+1)+bV2(n)=cV, n=0,1,2,3,etc. (5) 


It can be verified by substitution that the 
general solution of Eq. (5) is of the form 


Vo(m) =Cit+ Cor”, (6) 


where C; and C2 are arbitrary constants, and r 
is given by 
r=—b/a. (7) 


If we evaluate the arbitrary constants, we 
obtain further 


Ci=cV/(a+b)=pV=E (8) 
C2= V2(0)-—pV=V2(0) —E, (9) 
where we have set identically 
p=c/(a+d), (10) 
and 
E=pV, (11) 


and in which V2(0) represents the potential of 
the collector at the start, that is for »=0. 


Accordingly, the general solution of Eq. (3) is 


V2(n) =E+[V2(0)—E]}r’, 
*e=Q: 1,2. 3.ete: - (2) 


Equation (12) gives the potential of the collector 
after any integral number m of half turns of the 
rotor of the machine and at the instant when 
the carriers break contact with the brushes. 


THE TRANSIT TERM 


If next we substitute in Eq. (7) for a and 6 
their values as given by Eq. (4), and then expand 
according to the formula given by Eq. (2), we 


322 


obtain 


r= (Co2+ Cost Coot Cas) /(Co2+ Coat Coot Cas), 


which in turn can be written in the form 


r=(1+f)/(1+g), 


where 


f= (Cos+ C43)/(Co2+ C42) 


and 
= (Cast Cora) / (Coot Caz). 


Now from the fact that every coefficient of 
induction is negative and also that every coeffi- 
cient of capacity is greater than the abso- 
lute value of any corresponding coefficient of 
induction, i.e., Co2> | Ca2| and Cas> | Coal, we 
can deduce that f is negative and g is positive, 
hence r is less than unity. 

The term in 7 therefore represents a transient, 
which vanishes for large values of 2, from which 
it follows that as m increases, V2(m) approaches 
the value E=pV, which is constant. Hence we 
can state: irrespective of its initial value, the 
potential of the collector of the machine ap- 
proaches a constant value, which is proportional 
to the potential applied to the inductor. 

If we refer again to Eq. (12), we note also 
that if the potential of the collector originally 
has the value E, no change at all takes place; 
moreover, if the potential of the collector is origi- 
nally higher than the equilibrium potential E, 
the potential of the collector will decrease until 
it reaches the value E. 

If we rewrite Eq. (12) in the form 


E- V2(n) = [E- V2(0) ]r”, (12a) 


it is readily seen that the difference between the 
equilibrium potential and the actual collector 
potential is reduced by a constant factor 7 at 
each half-turn of the rotor of the machine. 

As we shall see below, the voltage E has the 
significance also of the electromotive force of 
the generator, and the collector behaves like a 
condenser charging through a high resistance. In 
order to exhibit these relations more clearly let 
us note that Eq. (12) can also be written in the 
form 


V2(n) =E+[V2(0) -—E Je", 
in which 


(12b) 


log r= —a. (12c) 


THE TRANSFORMATION RATIO 


The constant p, which may be termed the 
transformation ratio of the machine, is of especial 
interest. From Eq. (10), in view of Eq. (4), we 
have 


e= (Ci, 23— Ci, 24) / (C24, 2s—Coaa, 23) 
= (Cis— Cys) /(Cos4—Cos,3), (13) 


and if we expand this according to Eq. (2), we 
obtain 


p=(Cis— Crs) /(Cos+Cas—Cos—Caz). (13a) 


It is instructive to examine in detail the sig- 
nificance of the various coefficients appearing in 
the expression for p. First it should be noted that 
the coefficient C22. does not appear therein at 
all, this means that the transformation ratio is 
independent of the capacity of the collector. 
Next with respect to the numerator, the coeff- 
cient Ci; is a measure of the charge induced by 
the inductor on the carrier in the position in front 
of the inductor, while Cy, is a measure of the 
charge induced by the inductor on the carrier 
in the position where it is connected to the col- 
lector. It will be noted that the two effects oppose 
each other, the first tending to increase the 
charge carried forward by the carrier to the 
collector, the second tending to bind the charge 
on the carrier in the position in front of the 
collector. In the denominator the sum of the 
terms C23; and C4; represents the effect of the 
collector and the carrier in contact with it in 
inducing on the other carrier, that is the one in 
the grounded position, a charge of sign opposite 
to that induced by the inductor, this effect 
becomes greater as the potential of the collector 
rises, and for a sufficiently high-collector poten- 
tial could actually reduce the charge on the 
carrier to zero, in which case the action would 
be to discharge the collector until it reached a 
lower potential. However, this would occur only 
if the potential of the collector were initially 
higher than the equilibrium potential Z. On the 
other hand, the sum of the two terms Coq and 
Cus is a measure of the effectiveness with which 
the collector withdraws charge from the carrier 
in the position where the two are in contact. 
The more completely the collector surrounds 
the carrier, the closer the absolute magnitude 









er 


w~be 


ds 
de 





THEORY 





of Co, will approach C4, that is the more com- 
pletely will charge be withdrawn from the carrier. 
In the limit, if the collector surrounds com- 
pletely the carrier in position 4, we have Cog 
+Cu=0, as well as Cis=Cys3=0, so that p 
reduces to 


p= —Ci3/C23=const). 


From this it is seen that even though the 
collector surrounds completely the carrier in the 
position where it gives up its charge, and there- 
fore all the charge is withdrawn from the carrier, 
the potential of the collector does not rise in- 
definitely. This is due to the fact that the col- 
lector induces on the carrier when it is in the 
grounded position a charge of sign opposite to 
that induced by the inductor, and that the mag- 
nitude of this charge increases with the potential 
of the collector until the system comes to elec- 
trical equilibrium. 

It is of interest to examine the case where the 
carrier is completely surrounded by the inductor 
in position 3 but not by the collector in position 
4, that is, (C3;—C.i;) =0. For these conditions 
p reduces to: p= (Ci3— Cia) / (Coat Cas) =conste. 
From the last equation it is seen that com- 
pletely enclosing the carrier in the position where 
it receives its charge alone also does not produce 
an infinite transformation ratio. This is because 
less and less charge is withdrawn from the 
carrier as the potential of the collector rises, 
until eventually the system reaches electrical 
equilibrium. 

This suggests that we examine finally the effect 
of enclosing the carrier completely in both 
positions. In this case we would have simul- 
taneously Cy,+Co4=0 and Co3=Cy3=0, whence 
p=. From this it can be concluded that the 
transformation ratio of the machine becomes 
infinite only if the carrier is completely enclosed 
in two positions, namely when it receives its 
charge and when it shares its charge with the 
collector. 

The sign of p is negative, as can be seen from 
Eq. (13a), and deduced also from qualitative 
considerations, since, if the inductor is held at 
a positive potential, the carrier carries forward a 
negative charge, thus charging the collector to 
a negative potential. 


OF ELECTROSTATIC GENERATORS 





323 








TRANSFER OF CHARGE TO THE COLLECTOR 


If we denote by AQ2(m) the charge transferred 
to the collector at the mth sharing, we have 
obviously 


AQ2(n+1) =Q2(n+1)—Q2(n), 
n=0, 1, 2,3,etc., (14) 
or in terms of the potentials 
AQs(n+1) = Cos of Vo(n+1)— Vo(m)]. (15) 


If now we substitute for the potentials their 
values as given by Eq. (12), we have 


AQo(n+1) = Cos, of V2(0) —E](r—1)r”, (16) 
which in turn can be written in the form 


AQ2(n+1) = AQz2(1)r", (17) 


where we have put 
AQ2(1) = Cos, 2(r —1) [V2(0) —E]. (18) 


The quantity AQ.(1) obviously represents the 
quantity of charge transferred to the collector, 
or the increment of charge of the collector, at the 
first contact. 

From Eq. (18) it is readily seen that this 
quantity will be a maximum if the potential of 
the collector is initially zero and will be actually 
zero if the potential of the collector is originally 
equal to the equilibrium potential E. 

The maximum value obviously is 


(AQ2) max = — Cos 2(r—1) pV, (19) 


or, if we substitute for 7 and p their values in 
terms of the corresponding coefficients: 


(AQ2) max = (Cis— C4) Coa, 2V/ (Cas, ot Cos, 4). (20) 


The quantity of charge transferred to the 
collector is obviously reduced by the factor 7 at 
each contact, as seen from Eq. (17). In ex- 
ponential form Eq. (17) becomes 


AQz2(n) = AQ2(1) 2", (17a) 


where a is defined as before. 


ENERGY RELATIONS 


The question might be asked exactly how does 
the machine of Fig. 1 produce a potential greater 
than that applied to the inductor, and so behave, 
in a sense, as an electrostatic voltage multiplier. 

























































































324 a 
The answer is that work is done in moving the 
charged carriers against both the attractions of 
the oppositely charged inductor and the repulsion 
of the similarly charged collector, this work 
appears as increased electrical energy and there- 
fore increased electrical potential of the con- 
ductors of the system, in particular of the col- 
lector. This principle is the basis of many elec- 
trostatic devices, for example, the condensing 
electroscope, the electrophorous, the frictional 
static machine, the generation of thunderstorm 
electricity, and, finally, also the Van de Graaff 
machine. 

Quantitatively the energy relations can be 
followed, in particular, for example, the loss of 
energy on the sharing of charges between carrier 
and collector can be calculated, on the basis of 
the fundamental equation 


W=230:V;i, 


in which W represents the energy of the system 
and the summation is extended over all the 
conductors of the system. 


TRANSFER OF CHARGE WITH LOAD 
CONNECTED TO COLLECTOR 


So far conservation of charge has been assumed. 
Let us consider now the case when by some 
means a quantity of charge AQ is withdrawn 
from the collector during the time of one-half 
turn of the rotor or one cycle of the machine. The 
potential of the collector will then come to equi- 
librium when the charge transferred to the 
collector at each contact is equal to the charge 


Fic. 2. Schematic diagram 
of a belt-type electrostatic 
generator, which is funda- 
mentally the same as that 
shown in Fig. 1, except that 
the two carriers are replaced 
by a belt. In practice the 
battery is replaced by a high 
potential de generator. 





SIMON 


withdrawn from the collector during the cycle. 
The corresponding potential V2 can be found 
from Eq. (18) by setting therein AQ.(1) = AQ, 
whence we obtain: 


E— V2=AQ/Coa,2(r—1) 
which, on substituting for 7 its value, becomes 
E- Ve = AQ/Coz, 2(1 + Cos, 24/Cos, 24). 


Under these conditions the magnitude of the 
collector potential will remain at a value less 
than the magnitude of the equilibrium potential, 
and the rate of transfer of charge to the collector 
remains constant. 


EXTENSION OF THE THEORY OF THE 
BELT-TYPE GENERATOR 


It is readily seen that the fundamental action 
of the machine of Fig. 1 would not be altered 
if the number of carriers were increased from 
two to four, or, in general if the machine were 
provided with any even number of symmetrically 
placed, identical carriers, except that the charge 
would be delivered in more frequent pulses. 
Furthermore, a belt-type generator, such as the 
Van de Graaff machine, could be considered as 
comprising the limiting case of an infinite number 
of carriers of infinitesimal width: the current 
delivered to the collector now becoming con- 
tinuous. The close correspondence between the 
machine of Fig. 1 and a belt-type generator is 
shown by Fig. 2, in which homologous elements 
have been similarly indexed : the lower electrode 
1 functioning as the inductor, the sphere 2 as 
the collector, that portion 3 of the belt in the 
vicinity of the grounded comb as the carrier 
receiving charge, and, finally, that portion 4 
of the belt in the vicinity of the upper comb as 
the carrier sharing charge with the collector. 

It should be pointed out, however, that in the 
theory of the elemental machine we have as- 
sumed that the carrier made direct contact 
with the brushes, while in a Van de Graaff 
machine the brushes are replaced by a point dis- 
charge, which introduces an additional voltage 
drop into the circuit. The effective inductor 
voltage is then not equal to the voltage applied 
between the inductor and the point or discharge 
electrode, but that actually existing between the 





THEORY OF ELECTROSTATIC GENERATORS 


inductor and that portion of the belt immediately 
in front of it. It is of course between the inductor 
and the charge on the belt that the condensing 
electroscope or generating action takes place. It 
is also seen in this connection that the Faraday 
icepail principle is misleading in that it focuses 
attention on the action between the belt and 
the sphere, whereas the primary generating 
action is between the belt and the inductor, in 
fact it is in the region.of the inductor that the 
major portion of the increase in potential occurs. 

Extending now the theory of the elemental 
generator to the Van de Graaff machine, we can 
infer that the typical behavior of a belt-type, 
nonregenerative electrostatic generator (one in 
which the inductor is maintained at constant 
potential) would be that exhibited in Fig. 3. If 
the collector is originally at ground potential, 


e 


Fic. 3. Voltage and current curves of the machine of Fig. 2. 


the collector potential builds up exponentially 
to its equilibrium value E, which is a constant 
multiple of the exciting or inductor potential V 
(portion abc of the voltage curve of Fig. 3). The 
current flowing to the collector falls off, also ex- 
ponentially, as the collector potential rises (initial 
portion of current curve of Fig. 3). If by any 
means, say by partial discharge, the collector 
potential is suddenly reduced to a value Vz 
below the equilibrium value E, it recovers as 
shown in the portion b’c’ of the voltage curve, 
the portion 6’c’ being identical with the portion 
be of the original. Similarly the collector current 
now decreases to zero from an initial value 7, 
corresponding to the potential V, of the voltage 
curve. The equivalent circuit of the generator 
under these conditions is that shown in Fig. 4a, 
where E=pV is the electromotive force (given 
by the product of the transformation ratio p and 
the inductor potential V), R represents the in- 


325 


Ri 
E 
{#3 
E 

Fic. 4. Equivalent circuits of the machine of Fig. 2. 
ternal resistance of the generator, and C is the 
effective capacity of the collector. 

It is obvious that if at the instant the potential 
of the collector has been depressed to a value Vz, 
corresponding to a current 7, a load drawing a 
constant current % is connected to the collector, 


the latter will continue to supply the current 
i, to the load at the voltage V». For a purely 








Fic. 5. Schematic diagram of series-connected, belt- 
type generators. In practice the batteries are replaced by 
high potential dc generators. 





326 A. 


Ww. 


resistive load this would correspond to a resis- 
tance R, such that R,i,= Vy. Accordingly under 
these conditions the generator behaves as 
though the load resistor has been connected in 
parallel with a capacitance of effective value C. 
The equivalent circuit of the generator operating 
into a resistive load is shown in Fig. 4b. 

On the basis of the preceding considerations 
we can conclude: If all electrical breakdown and 
leakage is prevented, and if the inductor elec- 
trode of a belt-type generator is maintained at a 
constant potential, the collector will also attain 
a constant potential, the magnitude of which will 
be proportional to the inductor potential. The 
factor of proportionality will depend only on the 
geometry of the machine (actually on the ratio of 
certain capacities) and not on the medium in 
which it is immersed. Furthermore, the break- 
down voltage will set a limit to the potential 
attainable only if the equilibrium collector 
voltage corresponding to the product of the 
applied inductor voltage and the transformation 
ratio of the machine lies above the breakdown 


SIMON 


voltage of the sphere, and apparently for 
machines operated in air at atmospheric pressure 
this is usually the case. However, by operating a 
Van de Graaff machine under sufficiently high 
pressure it should be possible to verify that the 
terminal voltage, even under the conditions of 
no load or leakage loss builds up according to 
an exponential law, rather than linearly, as has 
been assumed. Experiments along this 
would be of interest. 


line 


SERIES OPERATION OF BELT-TYPE 
GENERATORS 


Since the machine of Fig. 2, contrary to 
common belief, cannot be used to achieve any 
desired potential, the question might be asked 
as to how this restriction can be overcome. The 
answer is that theoretically any desired potential 
can be attained provided a sufficient number of 
machines of the type of Fig. 2 are connected in 
series, for example as shown in Fig. 5. This 
arrangement also has the advantage that electric 
stresses are reduced. 


Motion Pictures for Classroom Use 


The American Journal of Physics regrets that the infor- 
mation and prices announced for the New AAPT Physics 
Films in the March issue [22, 155 (1954)] were out of 
date. The correct price of each of the New AAPT Physics 
Films is $50.00, or $45.00 each when three or more films 
are ordered at one time. Films in the Elementary Nuclear 
Series are $40.00 each, or $75.00 for the set of two. Orders 
for the films of either series may be addressed to the 
McGraw-Hill Book Company, Text-Film Department, 
330 West 42nd Street, New York 36, New York, or in 
Canada to 253 Spadina Road, Toronto 4, Ontario, Canada. 

Titles of films in the series previously announced are: 
Basic Physics Series: Uniform Circular Motion, Gasoline 
Engine (Otto Cycle), Diesel Engine (Ideal Diesel Cycle), 
Carnot Cycle (Kelvin Temperature Scale), Measurement 
of the Speed of Light, and Doppler Effect; Elementary 
Nuclear Series: U-238 Radioactive Series and The Nuclear 
Reactor. 

Four new films are now available: 


Simple Harmonic Motion, 10 minutes. Approach is 
dynamic. A mass is free to move horizontally under the 
action of a spring force. Hooke’s law is first introduced. 
The kinematics associated with the “circle diagram”’ is 
then illustrated, concluding with a brief consideration of 
the essential geometry. The relationship between the sim- 
ple harmonic acceleration and Hooke’s law is emphasized. 


Progressive Waves (Transverse and Longitudinal), nine 
minutes. To introduce the subject of transverse progressive 
waves, a pulse is sent down a stretched rope. Its velocity 
of travel is shown to be dependent upon the elastic and the 
inertial factors. Steady wave-train propagation along a 
stretched string is then illustrated. Wavelength and fre- 
quency are identified and related. A similar treatment 
follows for a longitudinal wave disturbance moving down 
a row of beads connected by light springs. 

Stationary Transverse Waves, nine minutes. The nature 
of stationary waves is approached from the point of view 
of the kinematics. Pulse reflection at the fixed end of a 
stretched string is studied, including the matter of phase 
change. The conclusions reached are applied to the reflec- 
tion of a steady wave-train at a string boundary. The 
resulting stationary waves are shown to be compounded 
of two oppositely traveling wave-trains. Nodes and anti- 
nodes are identified and their positions related to the 
wavelength. The film concludes with a brief consideration 
of the resonance patterns for a stretched string of limited 
length. 

Stationary Longitudinal Waves, nine minutes. The 
treatment in this film follows very closely that used in the 
film on Stationary Transverse Waves. A string of beads 
connected by light springs constitutes the medium. Both 
a pulse and a steady train of waves are shown. The situa- 
tion at a fixed end and at a free end is discussed. The film 
concludes with a brief consideration of pipe resonance. 





Elementary Derivation of Equations for Wave Speeds 


AusTIN J. O'LEARY 
The City College of New York, New York 10, New York 


(Received June 19, 1953) 


The equations for the speed of waves in the following cases are derived in an elementary way: 
transverse waves along a stretched wire and through an elastic solid ; longitudinal waves through 
a fluid in a uniform tube, through a uniform rod, and along a stretched spring; long gravity 
waves through a shallow canal. Several errors in previously published derivations are pointed 


out. 


HE following are new derivations of the 

equations for the speeds of several kinds 
of waves. I think they are not too hard for be- 
ginning students, are fairly rigorous, and make 
clear what happens to the transmitting medium 
during passage of the waves. It is to be under- 
stood in all cases that the waves are traveling 
without interference from reflected waves. A 
comparison with previously published deriva- 
tions is given at the end of the paper. 


TRANSVERSE WAVES ALONG A STRETCHED WIRE 


Think of a train of transverse waves traveling 
with speed v along a uniform stretched wire with 
an amplitude small enough in comparison with 
the wavelength that: (i) the tension can be taken 
to be the same at all points of the wire; (ii) the 
instantaneous circle of curvature of the wire at 
the crest of a wave can be taken to coincide with 
the wire for a small but at least finite distance on 
either side of the crest. For convenience in dis- 
cussion, suppose the wire is horizontal and that 
the waves are traveling to the right in a vertical 
plane. Figure 1 shows a short segment PP’ of the 
wire at an instant when the crest of a wave is at 
the point P’. The segment is taken short enough 
that: (i) it can be treated as an arc of the circle of 
curvature at P’; (ii) the tangents at P and P’ 
make a small enough angle Aé@ with each other so 
that the angle itself, its sine, and its tangent can 
all be taken equal to each other. The point P’ is 
momentarily at rest and the other points in the 
segment are moving downward in the wake of the 
crest. Letting ZL denote the length of the segment, 
it can be seen from Fig. 1 that the point P has 
moved down from rest a distance }LA@ in the 
interval of time L/v taken for the crest to move 
from P to P’ and that the center of mass C of the 


segment has moved down from rest a quarter as 
far in half the time. Thus, the acceleration is the 
same for each point of the segment and is mo- 
mentarily constant. Computing its value from 
the information for either point P or point C by 
dividing twice the distance through which the 
point has moved from rest by the square of the 
time, the acceleration is found to be v?A@/L. In 
the neighborhood of a crest, A@ is a maximum and 
therefore the acceleration is a maximum. To 
summarize: the acceleration of any tiny segment 
of the wire is greatest and momentarily constant 
when its displacement is a maximum; in so far as 
the acceleration is concerned, the segment can be 
treated as a particle. 

Letting F denote the tension in the wire, Fig. 2 
shows the forces on the segment PP’ at the 
following three instants during passage of a wave 
crest over it: (a) when the crest is at P; (b) when 
the crest is at C; (c) when the crest is at P’. It 
can be seen from Fig. 2 that the resultant force 
on the segment is momentarily constant and that 
its magnitude is FA@. In general, the resultant 
force on any tiny segment of the wire is greatest 
and momentarily constant when its displacement 
is a Maximum. 

Letting « denote the mass per unit length of 
the wire and applying Newton’s second law, we 





Fic. 1. Diagram of a very short segment PP’ of a 
stretched wire at the instant the crest of a transverse wave 
is at the point P’. 


327 





AUSTIN }j. 


Fic. 2. Diagrams showing the forces on a very short 
segment PP’ of a stretched wire at the following instants 
while the crest of a transverse wave is passing over the 
segment: (a) at the instant the crest is at the point P; (b) 
at the instant the crest is at the center of mass C of the 
segment; (c) at the instant the crest is at the point P’. 


have 
FA@=oL Xv°A0/L, 


v=+/(F/e). 


Alternatively, this equation can be obtained by 
dealing in a similar way with a segment of the 
wire at the very front of a wave train. Figure 1 
can be used without change in the case where the 
leading pulse of the wave train travels along the 
underside of the wire to show a segment PP’ of 
the wire at the instant the front of the wave train 
first reaches P’. The circle of curvature at P’ then 
refers only to the curved portion of the wire 
immediately to the left of P’ and not to the un- 
disturbed part of the wire to the right of P’. 
Figure 2(c) is the force diagram. The expression 
for the acceleration and the equation for 
Newton’s second law are the same as before. 


TRANSVERSE WAVES THROUGH AN 
ELASTIC SOLID 


Think of transverse waves traveling hori- 
zontally to the right through an elastic medium 
under constant pressure and of density p. Each 
cross-sectional area of the medium at right angles 
to the direction of propagation moves as a unit, 
so the kinematics of the motion is exactly the 
same as in the preceding case of the stretched 
wire. To deal with the forces concerned, suppose 
the medium is in the form of a rod or bar of 
uniform cross-sectional area A. Figure 3(a) is a 
force diagram showing the forces on a segment 


O’LEARY 


PP’ of the rod at the instant the crest of a wave 
is at the center of mass C of the segment. The 
resultant force on the segment is 2F, where F 
denotes the force exerted on each end of it by the 
rest of the rod. Letting S denote the shear modu- 
lus of the material, Fig. 3(b) illustrates the 
definition of S; the shearing stress is F/A, and 
the shearing strain is 340, where A@ has the same 
meaning as in Fig. 2(a); by definition, 


F/A 


Lag’ 
2F=SAAQ. 


Thus, the resultant force on the segment can be 
expressed in the form SAA@. Letting ZL denote 
the length of the segment, the mass of it is pLA 
and the acceleration of each point of it at the 
instant under consideration is v?A@/L as in the 
preceding case. Applying Newton’s second law, 
we have 
SA A0=pLA Xv*A0/L, 


v=V/(S/p). 


LONGITUDINAL WAVES THROUGH A FLUID 
IN A UNIFORM TUBE 


The problem is to find the speed v of longitu- 
dinal waves through a fluid of density p filling a 


tube of uniform cross-sectional area A. For 
convenience in picturing what happens, we shall 
deal with a single longitudinal pulse and shall 
take the tube to be horizontal and the pulse to be 


Fic. 3. (a) Diagram showing the forces on a very short 
segment PP’ of a rod at the instant the crest of a transverse 
wave is at the center of mass C of the segment. (b) Dia- 
gram to illustrate the definition of the shear modulus of the 
material composing the rod. 





ELEMENTARY DERIVATION OF 


traveling toward the right. A pulse of rarefaction 
affects the fluid differently from a pulse of con- 
densation, so we shall treat them separately. 


(i) Pulse of Rarefaction 


Let us first get a picture of what happens to 
any particular cross-sectional area of the fluid 
as a single pulse of rarefaction passes through it. 
As soon as the front of the pulse reaches it, it 
begins to be accelerated toward the left as a result 
of a pressure gradient across it, the pressure being 
less to the left of it than to the right of it. It 
moves toward the left to meet the ‘‘trough”’ of 
the pulse. By the time it meets the trough, its 
acceleration has decreased to zero and its velocity 
has increased to a maximum. Then, as the trough 
moves beyond it, its speed decreases and it ends 
up at rest a short distance to the left of its initial 
position just as the tail end of the pulse leaves it. 

To get an equation for the wave motion, we 
have to deal with a longitudinal section of the 
fluid. We shall describe what happens to an 
almost infinitesimally short section of the fluid; 
the cross-sectional area bounding the section on 
the left-hand side we shall call face f and the one 
bounding it on the right-hand side face f’. Let 
p denote the pressure in the fluid outside the 
rarefaction and p+ Ap the pressure in the trough, 
Ap being negative. Letting V denote the volume 
of section ff’ at pressure p and V+AV the 
volume at pressure p+ Ap, its length is V/A at 
pressure p and (V+AV)/A at pressure p+ Ap. 

Figure 4(a) shows the undisturbed position of 
section ff’ with the pulse of rarefaction approach- 
ing it; since ff’ has to be drawn with appreciable 
length for the purpose of illustration, the trough 
has to be shown at least as long, but actually the 
distance over which the pressure is a minimum 
can be almost infinitesimal because the length of 
ff’ is taken to be almost infinitesimal. The time 
taken for the pulse to travel from the initial 
position of face f to that of face f’ is the distance 
V/A between them divided by the speed v of the 
pulse. In other words, the interval from the 
instant face f begins to move until face f’ begins 
to move is V/Av. Likewise, V/Av is the interval 
from the time the trough and face f meet each 
other until the trough and face f’ meet each other. 
The two faces travel equal distances to the left 


EQUATIONS FOR WAVE SPEEDS _ 329 





Fic. 4. (a) Diagram showing the undisturbed position of 
a very short cross section ff’ of fluid in a uniform tube as a 
pulse of rarefaction approaches it. (b) Diagram of section 
ff' and the pulse at the instant the trough meets face f. (c) 
Diagram of section ff’ and the pulse at the instant the 
trough meets face f’. 


in meeting the trough. We shall use s to denote 
this distance. 

Figure 4(b) represents section ff’ and the 
pulse at the instant the trough and face f meet 
each other. By this time, the acceleration of face 
f has become zero and its speed has increased to a 
maximum value which we shall denote by v’. 

Figure 4(c) represents section ff’ and the pulse 
at the instant the trough and face f’ meet each 
other. By this time, the speed of face f’ has in- 
creased to the maximum value v’. It can be seen 
that in the interval of time V/Av from the time 
the trough and face f meet each other until the 
trough and face f’ meet each other, face f travels 
a distance AV/A with constant speed v’, from 
which we have 

AV/A 


V/Av 
=vAV/V. 


v= 


By the time the trough and face f’ meet each 
other, every particle of section ff’ has the same 
speed v’. The change of kinetic energy of section 
ff’ in acquiring the speed v’ from rest is }pV(v’)?, 
which can be expressed in the form 


1p V(vAV/V)?. 


In acquiring the speed v’, section ff’ has expanded 
from volume V at pressure p to volume V+ AV at 





330 


pressure p+Ap. The change of its potential 
energy, by definition, is the amount of work that 
would be done on it in this change of state if 
there were no accompanying change of kinetic 
energy. In the case of all ordinary longitudinal 
waves, Ap is small in comparison with p and AV 
small in comparison with V. Accordingly, the 
average pressure during a change of volume from 
V to V+AV without change of kinetic energy 
would be + 4Ap, whence the change of potential 
energy of section ff’ in the expansion is 


— (p+ 5Ap)AV. 


The total change of mechanical energy of section 
ff’ from its initial state of rest to its state in the 
trough is 


2p V(vAV/V)?— (p+ Z4p)AV. 


Next, consider the work done on section ff’ 
accompanying this change of energy. Without 
making any assumption concerning the shape of 
the wave, let p denote the average pressure be- 
tween the trough and the front of the pulse. The 
work done on ff’ is the sum of the separate 
amounts of work done on the two faces as follows: 
(i) a positive amount of work, 


pAs, 


done on face f’ by that part of the fluid to the 
right of it in displacing face f’ a distance s to the 
left; (ii) a negative amount of work, 


—pAs—(p+Ap)A XAV/A, 


done on face f by that part of the fluid to the 
left of it in letting face f move first through a 
distance s against an opposing force of average 
value pA and then through a distance AV/A 
against an opposing force of constant value 
(p+Ap)A. Adding the work done on faces f and 
f’, the two terms containing p cancel each other 
and the total work done on section ff’ in the 
change under consideration reduces to — (p+ Ap) 
XAV. 

Setting the work done on section ff’ equal to 
the corresponding change of its mechanical 
energy, we have 


— (p+ Ap)AV=— (p+ 2Ap)AV 
+4pV(vAV/V)?. (1) 


AUSTIN J. 


O’LEARY 


Combining the two terms involving pressure, we 
have 


—fZApAV=}3pV(vAV/V)?, (2) 
and solving for v, get 


| | 
SS Ss 
p 


v=V/(B/p), 


where B denotes the bulk modulus, — VAp/AV. 
This is the result we set out to derive. However, 
it is only part of the story. We naturally want to 
get a clear picture of the energy transformations 
that occur in the transmission of longitudinal 
waves. 

Transposing terms in Eq. (1) to get rid of the 
negative signs, we have 


(b+ 24p)AV= (p+ Ap)AV+3eV(VAV/V)?. (3) 


The quantity (p+3Ap)AV is the decrease of 
potential energy of section ff’. The quantity 
(p+ Ap)AV represents positive work done by 
section ff’ on other parts of the fluid. (It will be 
recalled that Ap is negative, which makes 
p+ Ap greater than p+Ap.) Thus, Eq. (3) 
states that the decrease of potential energy of 
section ff’ in the change from its initial state of 
rest to its state in the trough is equal to the 
kinetic energy it acquires plus the work it does 
on the rest of the fluid. 

Consider what happens to section ff’ while the 
pulse of rarefaction travels the rest of the way 
over it. By the time the tail end of the pulse is on 
the point of leaving face f’, section ff’ has been 
compressed to its original state and every particle 
of it has come to rest. The change of its volume 
is V—(V+AY), which reduces to — AV. (Notice 
that AV still denotes a positive quantity, al- 
though we are now dealing with a case of com- 


pression.) The change of potential energy of sec- 
tion ff’ is 


or, 


(D+ZAD)AV, 


a positive quantity, and the change of its kinetic 
energy is 


0—3pV(vAV/V)?. 
The work done on face f is 


—pAs, 





ELEMENTARY DERIVATION OF 


the work done on face f’ is 


pAst+ (p+ Ap)AavV, 
and the total work done on section ff’ is 
(p+ Ap)AV, 
a positive quantity. The equation is 
(p+ Ap)AV=(p+24p)AV—3ZpV(VAV/V), (4) 


which is the same as Eq. (1) multiplied through 
by minus one. The meaning of each term is the 
reverse of what it was before. According to Eq. 
(3), in the change of section ff’ from its state in 
the trough to its final state of rest, it loses all its 
kinetic energy and has positive work done on it. 
by other parts of the fluid on the one hand, and 


gains potential energy in equal amount on the 
other hand. 


(ii) Pulse of Condensation 


First, a brief review of what happens to any 
particular cross-sectional area or face of the fluid 
as a single pulse of condensation passes through it. 
As soon as the front of the pulse reaches it, it 
begins to move toward the right. By the time the 
crest has caught up with it, its acceleration has 
decreased to zero and its velocity toward the 
right has increased to a maximum. Then, as the 
crest moves beyond it, its speed decreases and it 
ends up at rest a short distance to the right of its 
initial position just as the tail end of the pulse 
leaves it. 

We shall use the same symbols as before with 
the difference that now p+ Ap denotes the pres- 
sure at a crest and Ap is positive, AV negative. 
During the interval from the time the front of the 
pulse reaches face f until the crest catches up with 
face f’, the work done on face f is 


pAs— (p+ Ap)AV, 


and the work done on face f’ is 
—pAs. 


The expression for the total work done on section 
ff’ is the same as in the case where it moved from 
rest into the trough of a pulse of rarefaction and 
so are the expressions for the changes of potential 
and kinetic energy. Thus, we get Eq. (1) as 
before. However, Eq. (1) in this case describes 


EQUATIONS FOR WAVE SPEEDS 331 
a different situation in view of the fact that AV is 
negative and p+Ap greater than p+3Ap. 
Putting it in words, it states that positive work 
is done by the rest of the fluid on section ff’ in 
giving it kinetic energy and in increasing its 
potential energy. 

Dealing with section ff’ as the pulse of conden- 
sation travels the rest of the way over it, Eq. (4) 
is obtained. In words, Eq. (4) in this case states 
that section ff’ loses potential energy and all its 


kinetic energy in doing positive work on the rest 
of the fluid. 


LONGITUDINAL WAVES THROUGH A UNIFORM 
ROD 


The equations for the preceding case are fitted 
to the case of longitudinal waves through a solid 
rod of uniform cross-sectional area A and density p 
by the following changes in symbols. Let ZL denote 
the undisturbed length of section ff’ and L+AL 
its length in the trough or at the crest of a pulse, 
as the case may be. In the undisturbed state, 
section ff’ is subject to a compressive force on 
the faces f and f’ equal to the product of the 
cross-sectional area A and atmospheric pressure. 
Let F denote this product and F+AF the com- 
pressive force on the faces in the trough or at the 
crest of a pulse as the case may be. Accordingly, 
substituting in Eq. (2) AF/A in place of Ap, 
AAL in place of AV, AL in place of V, and 
AL/L in place of AV/V, we get 


— AFAL=}pAL(vAL/L)?. 
Solving for v, we get 
—AF/A}} 
Lane | 
v=V/(Y/p), 


where Y denotes Young’s modulus of the 
material, —(AF/A)/(AL/L). 


(5) 


LONGITUDINAL WAVES ALONG A SPRING 


This case is the same as the preceding one 
except for two minor differences as follows. 
First, letting F denote the tension in the spring, 
AF and AL have the same sign, whereas they 
have opposite signs in the case of the rod where 
F denotes a force of compression. Second, letting 





332 AUSTIN J. 
o denote the mass per unit length of the spring at 
tension F, the mass of a segment ff’ of the spring 
is oL in place of pAL in the case of the rod. Thus, 
substituting o for pA in Eq. (5) and removing the 
minus sign from the left-hand side of the equation, 
we have 


4A FAL =}oL(vAL/L)’, 


from which we get 


[=] 
Ge eet , 


v= (Lk/o), 


where & denotes the constant of the spring, 
AF/AL. 


LONG GRAVITY WAVES THROUGH A SHALLOW 
CANAL 


A gravity wave means one produced through 
action of the force of gravity alone. An example 
of a long one is a wave in an incoming or an out- 
going tide. Think of a long shallow rectangular 
canal of breadth } containing a liquid of density 
p to a depth / and imagine the walls and bottom 
to be perfectly smooth and the liquid to be in- 
compressible. The problem is to find the speed v 
of a gravity wave through the canal with a wave- 


> bral = 


Fic. 5. (a) Diagram showing the undisturbed position of 
a very short cross section ff’ of liquid in a shallow canal at 
the instant the front of a long gravity wave reaches face f. 
(b) Diagram of section ff’ and the wave at the instant the 
crest catches up with face f. (c) Diagram of section ff’ and 
the wave at the instant the crest catches up with face f’. 


O'LEARY 


length long enough in comparison with the depth 
that horizontal displacement of a point on any 
particular cross-sectional area or face of the 
liquid can be taken to be the same for all points 
on the face. We shall describe what happens to an 
almost infinitesimally short cross-section ff’ of the 
liquid as a crest passes over it toward the right. 
As shown in Fig. 5, let ZL denote the length of 
section ff’ at depth h, L+AL its length, and 
h+ Ah its height at a crest; Ah is positive and AL 
negative. Since its volume never changes, we 
have 
Lhb=(L+AL)(h+Ah)b, 


which can easily be reduced to 
—AL/L=Ah/(h+Ah). (6) 


The interval from the time the front of a wave 
reaches face f until it reaches face f’ is L/v. Like- 
wise, L/v is the interval from the time the crest 
catches up with face f until it catches up with 
face f’. It can be seen from Fig. 5 that during 
this interval, face f travels a distance— AL with 
constant speed v’ given by 


—AL/(L/v), 


y= 


and substituting for —AL/L its value from Eq. 
(6), we get 


v’ =vAh/(h+ Ah). 


In the change of section ff’ from its undisturbed 
state to its state at the crest, it acquires a speed 
v’ ‘from rest and its center of gravity is raised a 
distance Ah. Denoting its mass by m, equal to 
pLhb, the kinetic energy it acquires is }m(v’)? 
which can be written in the form 


3mv*(Ah)?/(h+ Ah)’, 
and the change of its potential energy is 
3mgAh. 


The change of its mechanical energy is the sum 
of these two changes. 

To get the work done on section ff’ corre- 
sponding to this change of mechanical energy, we 
can take a short cut by analogy with the work 
done in the case of a pulse of condensation. It 
was seen there that two terms, pAs and —jAs, 
canceled each other with the result that the total 
work reduced to a single expression, — (p+ Ap) 
xX AV; what it means in essence is that the work 





ELEMENTARY DERIVATION OF 


is independent of the shape of the wave. The 
quantity in the present instance corresponding 
to —(p+Ap)AV is the product of —AL and the 
force on either face at the crest. The average 
pressure over a face of section ff’ is the same as 
the pressure at the center of the face, that is, 
tog(h+Ah) at the crest, and the force on either 
face at the crest is this pressure times the cross- 
sectional area b(h+ Ah). Thus, the work done by 
other parts of the liquid on section ff’ in the 
change under consideration is 


— AL X }pgb(h+ Ah)?. 


Substituting for —AL the value LAh/(h+ Ah) 
from Eq. (6), the expression for the work is 


4pLbgAh(h+ Ah), 
and substituting m/h for pLb, it becomes 
3mgAh(1+Ah/h). 


Setting the work equal to the change of 
mechanical energy, we have 


1 mv’ (Ah)? 
2 (h+Ah)? 


Ah 
me ( 1 +—) = }mgAh+ 


Combining the terms containing g and canceling 
out 4m(Ah)?, we have 


g v 


h (h-+Ah)? 


and solving for v, get 


Ah  (Ah)?q)3 
o= {if 1-2-4 It ; 
h he 


If the amplitude Ah is very small in comparison 
with the depth h, Ah/h and (Ah)?/h? can be 
neglected and the equation reduces to 


v=/ (gh). 


This is the result we set out to derive. However, 
there are several interesting things to note con- 
cerning the motion. 

According to Eq. (7), as section ff’ is overtaken 
by the crest, positive work is done on it by the 
rest of the liquid in giving it kinetic energy and 
in increasing its potential energy. It is interesting 
to see how much of the work goes into kinetic 
and how much into potential energy. Substi- 


EQUATIONS FOR WAVE SPEEDS _ 333 
tuting gh for v? in the expression for the change of 
kinetic energy and evaluating the ratio of the 
change of kinetic to the change of potential 
energy, one finds that the ratio is 


hAh/(h+ Sh)?, 


or approximately Ah/h. Thus, most of the work 
done on section ff’ by other parts of the liquid 
goes into potential energy and an almost negli- 
gible fraction of it into kinetic energy. As the 
wave moves the rest of the way over it, section 
ff’ loses all the potential and kinetic energy it 
gained in the first part of the motion and does an 
equal amount of positive work on the rest of the 
liquid. 

Equation (7) also describes what happens to 
section ff’ as a trough passes over it; in this 
case, Ah is negative and AL positive. As the 
trough overtakes it, it loses a certain amount of 
potential energy on the one hand, gains kinetic 
energy and does positive work on the rest of the 
liquid in equal amount on the other hand. Then, 
as the trough passes the rest of the way over it, 
all the potential energy it lost in the first part of 
the motion is regained as a result of losing its 
kinetic energy and having positive work done on 
it by the rest of the liquid. 


COMPARISON WITH PREVIOUSLY PUBLISHED 
DERIVATIONS 


Elementary derivations of the equations for 
wave speeds in the same cases treated in this 
paper have previously been presented in this 
journal by S. Millman and M. W. Zemansky.! In 
the case of transverse waves, they dealt only with 
the special case of a ‘triangular wavefront.”’ It is 
also an idealized case since the triangular shape 
means that each particle of the transmitting 
medium, when the wavefront reaches it, is in- 
stantaneously given a constant speed which we 
shall denote by v’’. They applied Newton's 
second law in the form: resultant force equals 
mass times acceleration. The segment to which 
they applied it is a segment of increasing length 
and therefore of increasing mass to which this 
form of the law does not apply. Instead, the 
resultant force on the segment has to be set equal 
to its rate of change of momentum, v’’dm/dt. The 


1S. Millman and M. W. Zemansky, Am. J. Phys. 13, 250 
(1945). 





334 


law can be correctly applied by substituting vc 
for dm/dt and Fv’’/v for the resultant force in the 
case of a wire, vpA for dm/dt and SAv’’/v for the 
resultant force in the case of a rod. Their treat- 
ment of longitudinal waves holds only for a 
‘square wavefront.” In the case of long gravity 
waves, they took the average pressure on a face 
of the liquid to be constant, }pgh. On that basis, 
the work done on section ff’ as a crest overtakes 
it would be 


gpghb(h-+ Ah) X 


LAh 
(h+Ah)’ 


which reduces to }mgAh and would be just equal 
to the increase of potential energy, leaving noth- 
ing at all for kinetic energy. Obviously, the 
problem can not be treated correctly without 
taking into account changes of pressure. 

A number of wrong derivations of the equation 
for the speed of longitudinal waves have been 
pointed out by W. W. Sleator.? Unfortunately, 
he made a slip that should not find its way into 
our teaching. In the third section of his paper 
under the heading ‘‘A New Proof,’’ the proof is 
directed toward establishing the following state- 


ment: ‘“The basic fact is that in plane progressive 


2W. W. Sleator, Am J. Phys. 17, 51 (1949). 


AUSTIN }. 


O’LEARY 


sound waves the potential and kinetic energies 
per unit volume are everywhere equal at all 
times.”’ This statement is immediately refuted by 
the fact that the changes of potential and kinetic 
energy of each tiny element of a medium in trans- 
mitting a pulse of rarefaction are always of 
opposite sign. Even in the case of a pulse of 
condensation where these changes have the same 
sign, they can never be equal. The ratio of the 
change of kinetic to change of potential energy is 


pV (vAV/V)? 
—(p+3Ap)AV 


and substituting —VAp/pAV for v?, the ratio 
reduces to $Ap/(p+ Ap), or approximately 
3Ap/p. Thus, the change of kinetic energy is 
almost negligible in comparison with the change 
of potential energy. In terms of the present 
notation, Sleator’s error lies in taking the 
quantity on the left-hand side of Eq. (2), 
namely —}ApAV, to be the change of potential 
energy, which is just not so. Then, in example (3) 
of section 4 under the heading ‘‘Four Correct 
Versions,’ he derives the equation for the speed 
very simply by starting with Eq. (2) of the pres- 
ent paper, which is begging the question since 
the basis on which he sets up Eq. (2) is invalid. 


Western Michigan College and Kalamazoo College 


An Evening of Physics—consisting of coffee hour, 
dinner, physics exhibit, and talk on transistor physics— 
was held March 22 on the Kalamazoo College campus. It 
was sponsored jointly by Western Michigan College and 
Kalamazoo College, and it was attended by physics faculty 
and major students from six other colleges and junior 
colleges and by physics teachers from about ten south- 
western Michigan high schools. 

About 75 attended the dinner in the college dining hall. 
The physics exhibit, following the dinner, was partly 
modeled after the very successful exhibit of novel devices 
held in conjunction with G. W. Stewart’s University of 
Iowa Colloquium of College Physicists every June. How- 
ever, in this case, the competition was between under- 
graduate college students at the various schools instead of 
between the teachers. Six devices were entered, and through 
the courtesy of the Chemical Rubber Publishing Com- 


pany, Cleveland 14, Ohio, two copies of the Handbook o 
Chemistry and Physics were awarded as prizes. The two 
winners this year were Richard Doolittle, a student at 
Western Michigan College, for his model Van de Graaf 
generator, and Bruce Van Domelen, a student at Kala- 
mazoo College, for his Geiger counter, power supply, and 
scaler. Harold Gaut, a Western Michigan student, won 
honorable mention with his longitudinal wave generator. 

The speaker of the evening was Dr. John A. Hornbeck 
of Bell Telephone Laboratories, who gave a demonstration 
lecture on ‘‘Transistor Physics.” He gave a very careful, 
detailed explanation of how the junction transistor works, 
and illustrated his talk by a number of demonstrations 
and slides. About 125 attended the talk. 


GerorGE E. BRaDLEy, Western Michigan College 
Rapa O. KERMAN, Kalamazoo College 





An Interpretation of the Variable Mass in Wessel’s Theory of the Electron 


S. J. Czyzak * 
Aeronautical Research Laboratory, Wright Air Development Center, Air Research and Development Command 
Wright-Patterson Air Force Base, Ohio 


(Received August 10, 1953) 


The work of Dirac, Bopp, Hénl, and Wessel on the theory of the electron is briefly reviewed. 
The equations of motion, with a variable rest mass, are integrated to obtain the velocities. The 
initial condition in this development can be so chosen that one obtains a motion which is 
analogous to the “zitterbewegung”’ first derived by Schrédinger. The connection between spin 


and radiation reaction force is clarified. 


I. INTRODUCTION 


O understand the nature of matter has 
been the major pursuit of physicists and 
philosophers. It is believed that matter is built 
up from a few elementary particles: the proton 
and electron which have been known for quite 
some time; the neutron, positron, and mesons 
which have been conclusively proven by experi- 
ment; and still others, which have been. postu- 
lated to satisfy theory, but have not been experi- 
mentally verified. 
Both classical and quantum electrodynamics 


break down below a certain limit represented by 
the elementary length e?/myc?. Electrostatics, on 
the other hand, runs into the difficulty of an 
infinite self-energy for point-like charges, which 


is meaningless. If, however, the electron is 
assumed to be spread through a small volume, as 
Lorentz originally suggested,! then difficulties 
arise when the acceleration of the electron is 
treated relativistically. 

With the development of quantum mechanics 
it was hoped that the foregoing difficulties would 
be eliminated. However, new difficulties arose 
instead. The models for which the electron was 
spread over a small volume were too difficult to 
quantize and the point-like charge models which 
lead to infinities in classical theory were equally 
difficult in quantum theory. 

If Dirac’s original theory is reviewed,’ it will 
be observed that he retains Maxwell’s theory to 
describe the field including the point singularity. 
The terms which give rise to the infinities are 


*On leave of absence from University of Detroit, 
er Michigan. 
A. Lorentz, The Theory of the Electron (Leipzig, 
1916), second edition. 


2P,. A. M. Dirac, Proc. Roy. Soc. (London) A167, 148 
(1938). 


substracted out and this is possible because of the 
mathematical formalism that is used. The essen- 
tial difference between Dirac’s and Lorentz’s 
theory is that the former appears to be exact 
within the limits of classical theory whereas the 
latter is only an approximation. Work by Lamb 
and Retherford,* Schwinger,‘ Feynman,* Bopp,® 
and others did much to eliminate the difficulties 
encountered in this problem of electrodynamics. 
Still, as Dirac’ points out, the resulting theory 
leaves much to be desired. An excellent review of 
the interaction of electrons and the electromag- 
netic field has been written by Eliezer.* In 
passing it should be mentioned that since 
Eliezer’s publication Bopp has developed an 
excellent theory of linear electrodynamics with- 
out infinities in quantum mechanics which is 
very attractive even though the quantization 
limits it to integral spins. Recently, Dirac? has 
developed a new classical theory of electrons; 
however, considerable work will be necessary 
before its significance can be determined. 

When Dirac’? developed his original theory of 
the electron and positron, he did so without 
reference to any specific model, i.e.,:the theory 
was based solely on quantum-mechanical con- 
siderations. In recent years Bopp,* Hénl,!” and 


3 W. E. Lamb and R. C. Retherford, Phys. Rev. 72, 241 
(1947); 79, 549 (1950); 81, 222 (1951). 


Schwin ger, Phys. Rev. 74, 1439 (1948); 75, 651 
(1949); 76, 790 (1949). 

5R.P. Feynman, Phys. Rev. 74, 939, 1430 (1948). 

*F. Bopp, Z. Naturforsch. 1, 196 (1946) ; 3a, 564 (1948). 
Z. Physik 125, 615 (1949). A review article on this theory is 
given by H. Hénl in Ergeb. pone. 5 Naturw. 26, 291 (1952). 
(1981). M. Dirac, Proc. Roy. Soc . (London) A209, 291 


aa J. Eliezer, Revs. Modern Phys. 19, 147 (1947). 


P. A. M. Dirac, Proc. Roy. Soc. (London) A117, 610 
(1928); A126, 360 (1931). 


10H. Honl. Z. Naturforsch. 3a, 573 (1948). 


335 





336 8. J. 


Wessel"! have derived Dirac’s theory of the 
electron on a classical basis and the essential 
difference is in the mathematical formulation 
which leads to the replacement of the commuta- 
tion rules by Poisson Brackets. The two inde- 
pendent developments of Bopp and Wessel are 
very attractive and justify further inquiry. 


Il. PRELIMINARY CONSIDERATION 


Essentially, the developments of Bopp, Ho6nl, 
and Wessel stem from the desire to eliminate 
the field quantization difficulties by considering 
certain factors overlooked in the older electron 
theory. 

A clear fundamental approach to the inter- 
action of the electron with its own field was 
developed by Bopp.* He showed that when the 
development of the Lagrangian in powers of the 
retardation parameter is carried one step further 
an acceleration-dependent rest mass and higher 
time derivatives (xj, <j, #;, ----+) in the equation 
of motion are obtained. In his development the 
nonconservative part of the radiation reaction 
force is neglected. While the motion is now con- 
servative and can be derived from Born’s 
variation principle 


(of L0e.e)ar =0, 


which leads to fourth-order equation: 


— —_ —+—_-=(0) 


dt 0% ax 


@ aL dat aL ) 


dt? dz 


the Poisson brackets are obtained directly. 

H6nl has approached the problem from a very 
novel standpoint in that he describes the spinning 
electron by the superposition of a (positive) mass 
and a (positive-negative) mass dipole. The mass 
dipole supplies the angular momentum. Bopp 
shows that Hénl’s theory coincides with his own 
to a first approximation. With the mass dipole 
interpretation the charge does not coincide with 
the center of gravity. This is explained by the 
fact that in the emission and reabsorption proc- 
esses the energy fluctuates about the charge. 

1 W. Wessel, Ann. Physik 43, 565 (1943); Z. Natur- 


forsch. 1, 622 (1946); 6a, 473, 478 (1951); 7a, 583 (1952); 
Phys. Rev. 76, 1512 (1949). 


CZYZAK 


Further, those fluctuations will occur when the 
total charge is zero, so that spin exists for non- 
charged particles also. 

Wessel’s approach to the problem of the theory 
of the electron has been to start with the non- 
conservative part of the radiation reaction force. 
The fact that the equation of motion is then of 
the third order leads to the independence of 
velocity and momentum, which is so character- 
istic for Dirac’s theory. In order to obtain the 
equation of motion through Poisson brackets 
from a Hamiltonian, an acceleration-dependent 
rest mass and a magnetic moment as an inde- 
pendent variable were required. Having accom- 
plished this, the theory was readily quantized by 
reinterpreting the Poisson brackets as closed 
commutator algebra. The Poisson brackets were 
first proposed by Wessel on a heuristic basis. 
More recently Wessel and Czyzak"” have given 
a derivation of the Poisson‘brackets based on a 
few plausible assumptions. In the absence of the 
Lorentz force, the motion under the (third-order) 
radiation reaction force is contained as a partic- 
ular integral in the (fourth-order) equation of 
motion, which is coincident for integral spin, 
with the equation of motion of Bopp. In the 
presence of the Lorentz force a slight alteration 
of the Hamiltonian is necessary. But both the 
classical and quantum-mechanical treatments 
appear to require the same modification of the 
Hamiltonian. 

While the developments of Bopp and Hénl are 
very attractive, certain serious drawbacks do 
exist. Namely, Bopp and H6nl are unable to 
account for the Lamb shift or the half-integral 
spin, and further, the nonconservative radiation 
reaction force is neglected. In Wessel’s theory all 
three are included, but quantization leads to 
wrong numerical factors, which seem to indicate 
that the Hamiltonian has still to be refined. 


III. THE VARIABLE MASS 


In classical theory the equation of motion for 
a free electron, when the mass is constant, is 


2e? 
mu;' =—{u;" —u;(u,'u'*)}, (1) 
33 


12 W. Wessel and S. J. Czyzak, Phys. Rev. 91, 986 (1953). 





WESSEL’S THEORY OF THE ELECTRON 


where (’) means differentiation along the world 
line. The general solution for this equation is of 
the following form: 


u;=A,;exp(C exp[3mc’s/2e? ]}) 
+B;exp(—C exp[3mc’s/2e]), (2) 


where s is the length of the world line and A, B, 
and C are arbitrary constants and subject only 
to the conditions: 


A ,Ai=B,Bi=0, (3) 
and 


(4) 


This, then, gives the motion for the electron in a 
plane with its velocity approaching that of light. 
In this type of motion the electron loses energy 
at an extremely rapid rate, a fact which has not 
been observed in nature. Thus, with this solution 
an unreal result is obtained. If, however, one 
choses a particular solution in which the condi- 
tions are: u;=C; and C;C’= —1, then no loss of 
energy is obtained and the velocity remains 
constant. This type of behavior of the electron 
has been observed and therefore the motion is 
real. 

In the foregoing equation of motion, the mass 
has been considered constant and thus only the 
velocity is the important variable in the equation 
of motion. If, on the other hand, one were to 
consider the independence of momentum and 
velocity of Dirac’s theory explained classically 
as a consequence of the radiation reaction force 
by Wessel, then a much more interesting result 
is obtained. For now, the mass must be con- 
sidered as a variable. 

The equation for the variable mass postulated 
by Wessel! is 

2e? 
m'’ =—(u,'u"?). (5) 
3c? 


This implies that the energy radiated by the 
moving charge is added to its rest mass, which 
may be interpreted in the sense of Sec. II of a 
previous paper.” 

Otherwise, the equation of motion for the free 
electron in the absence of the Lorentz force is 
Eq. (1), as before. The following identities must 
be satisfied : 


ujui=—1i, uju'i=0, 


337 


and 
ujui+u,'u'?=0. (6) 


When Eq. (5) is substituted into Eq. (1), we get 
2e* 


m'u;-+-mu;’ =—(muj) =—u,;", (7) 
ds ie 


x 


where the symbol s means here and in the follow- 
ing the length of the world line. 
Integration gives 


9 


2e° 
eng FOE (8) 
3c 


where p;/c is an arbitrary constant 4-vector. 
Thus, we have regained Eq. (1.9) of a foregoing 
paper” in which the g; has the nature of momen- 
tum. 

If Eq. (8) is multiplied by w/ and the first two 
identities of Eq. (5) are used, we get 


pjui=—mce. (9) 


Solve Eq. (8) for u;’ and square it. Also use 
Eqs. (5) and (9) to get the solution for m’ which 
becomes 


(10) 


3c 
m' =—(m?+ p?/c*). 
2e* 


Equation (10) can be integrated; however, 
before this is done it should be pointed out that 
p?/c? can be taken to be either positive or 
negative. In the former choice the interpretation 
of the p; is that it is space-like whereas in the 
latter it is time-like. Regardless of which choice 
is made, the m’ must always be positive, with 
regard to Eq. (5), because w; is time-like, and u,’ 
is space-like. So, when m’ is integrated, two solu- 
tions are possible depending upon the'sign of the 
p?/c. 

For the case where p; is chosen to be space-like 
integration of Eq. (10) yields 

m=mbp tanI's/X, 


(11) 
where p?/c?=m,?. When p; is chosen to be time- 
like then the solution is 


m= —my ctnhY,/X, 
x ye 


Yr 3moc? 


(12) 
where 


pP?/?=—m?, and 





338 oe F 


If Eqs. (11) or (12) are substituted into Eq. 
(8) and integrated, the result is 
uj=c;secT's/R—p;/moc tanI's/X, = (13) 


(14) 


and 
uj=c; csch I's/X —p;/moc ctnh I's/X, 


where c; represents a second set of integration 
constants. To fulfill the condition uj;u?=—1 in 
Eqs. (13) and (14), it is necessary that cj;ci/= —1 
and c;p’=0 for Eq. (13), whereas cjci=1 and 
c;pi=0 for Eq. (14). It can be seen that when 
p; is space-like then c; is time-like and vice versa. 
For c; time-like c;=u,° and for c; space-like U;°. 

Going over to the ordinary velocity, Eqs. (13) 
and (14) become 


us up — p/moec sinI's/X 


ut uot —p*/moc sinI's/X 


Vv 
c 


(p, space-like) 
u = Up — p/moc coshl's/X 


ut Uot—p*/moc coshT's/X 


(15) 
v 
c 


(p, time-like) (16) 


IV. DISCUSSION 


Formulas (15) and (16) represent the general 
solution for the motion of the electron for p; 
space-like and time-like. Not all the vector com- 
ponents that appear in these equations can be 
chosen freely. To evaluate these equations it is 
necessary to consider p; in terms of ordinary 
vectors. 

For p; space-like the following relations hold: 


pip! = (moc)? = p’ — (p*)*; 
pt = {p? — (moc)?}?, 


U;°Uo) = —1=u,?—- (uo*)?; 


then 


(17) 
and 


or 
Uo* = (1 + uo”) : 


Since the condition c;pi=0 demands that 
Uo pi = Uop —up'p* = 0, 
then 
a. 
(p? — (moc)?)* 
and therefore, 


Uo p= (1+ U0) #(p? — myc’). (18) 


CZYZAK 


If we let p=z-moc, where z is a dimensionless 
vector, Eq. (17) becomes: 


Z-Uo= (2? —1)*(1+u07)!, 
|2| |wo| cosa = (z? —1)#(1+1u,?)?, 


where a is the angle between z and wp. 
Similarly, for p; time-like, we obtain 


p*= (p?+ (moc)’)*, 
Uo- p= (Uc? —1)*(p? —my*c*)$, 


(20) 
(21) 
z-Uy= (z?+1)3(U¢? —1)} 


|z| | Uo] cosa= (z?+1)3(U?—1)*. = (22) 


Equations (15) and (16) may now be written 
in the following form: 


v Up —Z sinI's/X 
: (1-++- uy”)? — (1—z?)? sinT's/X 


v Uy —z coshT's/X 


tn ceeennerrsiennmeeeeenenin, (iil 
c (U,?—1)*—(z?+1)! coshl's/X 


Since the general formulas do not lend them- 
selves to ready interpretation, let us consider the 
solution for cosa=1 and cosa=0. If Eq. (8) is 
re-examined, it will be observed that the choice of 
the direction of the constant of integration, p;/c, 
with respect to u; determines the motion. For 
p/c parallel to u; in which u,’ is also in the 
direction of u; produces motion in a straight 
line. In this case we have the condition cosa=1. 
As a consequence of Eqs. (17), (18), (20), and 
(21), the following relationships are obtained: 


p; space-like 
uy = (2?—1)3 


uot =2 


p; time-like 
Uo= (2+1)! 
Ut=2 

pt/moc = uot =2 p*/moc = Uo =2. 


Our two equations, (15) and (16), now yield, 
respectively : 


v_ e-((2?—1)!—2 sinI's/X) 


-= (25) 
c 2—(2?—1)!sinI's/x 
v e-((22+1)!—2z coshT's/X) 


c z—(22+1)! coshI's/X 





WESSEL’S THEORY OF THE ELECTRON 


where e is a unit vector in the direction of up and 
S. 

When the initial condition in Eq. (8) for p;/c is 
so chosen that u,’ is not in the direction of u,, 
then the motion is not in a straight line. For 
instance the p/c may be chosen orthogonal to u; 
and we have the condition cosa=0. For cosa =0 
the relationships are: 


p; space-like 


g=j 


p; time-like 
Ue=1 
U,=e 
U.t=0 
p*/moc = (22+1)}. 


For this particular case Eqs. (15) and (16) 
become 


z=e 


prue 


Vv Uo e . 
—=—— —— sinI's/X 
Cc Uo! Uo* 


(27) 


v Z 


e 
— sechr's/X, 


a (28) 
c (#+1)! (¢#+1)! 


respectively. 

The right side of the Eqs. (23), (24), (25), (26), 
(27), and (28) were integrated to obtain the 
average value for v/c. The results obtained were 
not very revealing; however, what is significant 
is that one can obtain an average value. This is 
not possible in the case of the other electron 
models.® 

A very interesting particular motion is ob- 
tained for cosa=0, where p* and U>! are zero. If 
we examine Eqs. (27) and (28), we will observe 
that Eq. (27) is a periodic motion along the 
world line, whereas Eq. (28) is aperiodic along 
the world line However, both are aperiodic in 
ordinary time. 

In other theories the equation of motion has 
only one variable, namely u;, whereas in Wessel’s 
theory two variables, u; and p;, exist explicitly, 
as shown by Eggs. (27) and (28). We also observe 
that the classical motion is not constant; instead 
we observe that it has a behavior very similar to 
the ‘“‘zitterbewegung”’ first described by Schré- 
dinger™ for the behavior of wave packets in the 
force free field. Thus, quite similarly then, we 
have the constant term which would correspond 


13 E. Schrédinger, Berl. Ber. Nr. 11/12 S.105 (1932). 


339 


to classical motion and the variable term which 
corresponds to the “‘zitterbewegung”’ type mo- 
tion. However, except for general similarity of 
form there do exist significant differences. For 
example our frequency, '/A=3mgc?/2e?, which 
also represents the reciprocal length referred 
to the world line, differs from Schrédinger’s, 
v=2mc?/h, by a factor 3(137). The question still 
to be answered is the corresponding meaning of 
Ir'/A to that of Schrédinger’s frequency. The 
physical meaning remains unanswered. The 
important point, which should not be overlooked, 
is the fact that by this method one does obtain a 
relatively simple and reasonable solution to the 
electron problem, whereas if the methods ex- 
plained by Eliezer® are used, complicated and 
unreasonable solutions are obtained. 

Since in the force free case, we do not get 
uniform motion, the angular momentum is not 
constant. Wessel'! showed that one can introduce 
an intrinsic spin so that the variation of the 
angular momentum is just compensated and the 
total angular momentum is again constant. The 
fact that the angular momentum is disturbed by 
the radiation reaction force is to be expected. 
The remarkable feature is not the fact that the 
spin just compensates for this variation, but that 
the introduced spin obeys reasonable equations of 
motion. 

Still to be determined is the choice of the vari- 
able mass equations, namely, Eqs. (11) or (12), 
which specify the space-like and time-like p,, 
respectively. To resolve this let us compare 
Eqs. (13) and (14) with Eqs. (6.20) and (6.23) of 
a previous paper.” 

Let c;=a, and p;/moc = — U; in Eq. (13); then 
we can write 
a; U;sinT's/X ; 


uj; =———_ 
cosI's/X 


; 29 
cosI's/X - 


If we square Eq. (29), we find that ujui=—1 is 
satisfied. 

Similarly, let c;=U; and pj/moc=—@; in 
Eq. (14); then we get 


U; a; coshT's/X 


(30) 


“3 =————_ 
, sinhT's/X = sinhT's/X 


and if it is squared we satisfy the condition 
ujui=—1. 





340 S. J. CZYZAK 


Comparing Eqs. (29) and (20) with Eqs. (6.22) also seems to have a more reasonable eigenvalue 
and (6.23) termwise,’ we find that spectrum. 

The important difference to observe regarding 
the mass is that in the models discussed by 
for p; space-like. For p; time-like we have Eliezer,’ the mass goes to infinity exponentially, 
whereas in our case it goes to infinity trigono- 
1/sinhl's/A=sinh’A and ctnh’'s/A=coshr'K. metrically. Thus, the rate at which the mass 

(32) approaches infinity is considerably reduced. The 

The equation for the variable mass, when ; is fact that the een infinity is not particu- 
time-like, is # = —m, ctnhI's/4 so that from Eq. larly disconcerting because the physical meaning 
130) we can sedate it to the K writin, ic., is connected with mass operator and its eigen- 

values. 
m= —my, ctnhI's/A= mp coshr'K. (33) ACKNOWLEDGMENTS 


1/cosf's/A=sinhTA and tanl’'s/A=coshT'K (31) 


Similarly, when we consider the p; as space- ‘The author is greatly indebted oe Dr. Walter 
like, we get Wessel for suggesting this investigation, his many 
helpful criticisms and stimulating discussions. 

m =m tanl’'s/A=mp sinh’. (34) The author also wishes to express his apprecia- 

tion to Colonel L. B. Williams, Chief, Aero- 

nautical Research Laboratory, Wright Air De- 

velopment Center, and Dr. J. E. Clemens, Chief, 

Physics Research Branch, for their encourage- 

ment during the entire period required for this 
144W. Wessel, private communication. study. 


The question as to which of the above two 
equations is the more desirable one has not yet 
been resolved. Wessel" prefers Eq. (34) because 
it leads to the periodic motion in Eq. (29) and 


Professor O. H. Blackwood 


The American Association of Physics Teachers notes with regret the death of Dr. O. H. 
Blackwood, professor of physics at the University of Pittsburgh, on March 21, 1953. Dr. 
Blackwood labored long and well to improve the teaching of physics and to stimulate interest 
in physics. He felt that the study of physics is an important part of a liberal education, and 
many of his activities were directed toward encouraging nonscience majors to study physics 
in high school or college. Dr. Blackwood was author or coauthor of four textbooks. A frequent 
participant in the meetings of the AAPT, he contributed a fresh viewpoint to many of the 
perennial problems of teaching. Dr. Blackwood served as member of the Council of the AAPT 
and as president of the Western Pennsylvania Section.—W. C. KELLY. 


Professor George Alfred Stinchcomb 


George Alfred Stinchcomb, professor of physics at Heidelberg College since 1919, died at 
his home in Tiffin, Ohio, on November 17, 1953. Dr. Stinchcomb was a graduate of Heidelberg 
College in the class of 1915. He held the Master of Arts degree from the University of Michigan 
(1922) and the Ph.D. degree from Ohio State University (1939). Dr. Stinchcomb was a veteran 
of World War I and was wounded while serving in France as lieutenant. Dr. Stinchcomb 
was very active among Ohio physics teachers and was a member of several scientific societies. 
A memorial scholarship is being established by his friends. 





AMERICAN JOURNAL OF PHYSICS 


NOTES AND 


A Note on the Utilization of Radiant Energy 
from the Sun in Rockets 


WILLIAM M. Conn 
6312 Indiana Avenue, Kansas City 30, Missouri 


RINCIPAL limiting factors in the design of rockets 

are exhaust velocity, mass ratio (rocket plus fuel to 
rocket without fuel), and maximum initial thrust. The use 
of atomic energy would greatly improve the mass ratio, but 
problems of excessive temperature, difficulty of cooling, 
and radiation shielding would arise.! The purpose of this 
note is to point out another method of providing energy 
for rockets which is intermediate between traditional and 
atomic energy. It is the utilization of radiant energy from 
the sun collected during flight of the rocket. Solar energy 
would naturally serve first as an auxiliary and only later 
as the principal source of energy. 

New methods have been developed in recent years in 
which solar energy has been utilized to heat materials to 
very high temperatures (up to 7000° F) and to control the 
temperature accurately.” 

The design of a rocket powered by solar energy might 
be visualized as follows: the collecting mirror would be 
covered by a transparent hood shaped according to the 
streamlined exterior of the rocket. The mirror would be 
of paraboloidal or spherical shape, the former giving best 
results if pointed towards the sun. The spherical mirror 
would not have to be pointed to the sun if designed as a 
partly coated transparent sphere. The paraboloidal mirror 
would permit collection of incident energy over a wide field 
of view; the spherical mirror would produce a well-defined 
image for only a few degrees. The collected solar energy 
would serve for two tasks, (1) it would heat a working 
fluid, thus replacing fuel used to supply various mecha- 
nisms with energy and avoiding expensive transportation 
of fuel from the earth’s surface; (2) it would be used for 
the propulsion of the rocket. 

A solid propellant with a very high ignition temperature 
would replace the conventional solid propellant burning 
at a comparatively low temperature in the combustion 
chamber and the liquid propellant heated by chemical 
reaction. An image of the sun smaller in diameter than 
the diameter of the combustion chamber would be 
projected through a window of fused silica into the cham- 
ber. The entire chamber and the nozzle would be filled 
with high-temperature propellant leaving only a narrow 
central bore along the axis. The part of the propellant on 
which the solar image is formed would be heated by the 
incident radiation. Combustion or evaporation would 
start immediately from the solar image and the region 
adjacent to it, and the reaction products would be expelled 
through the nozzle. Used propellant would be replaced by 
mechanically feeding rods of solid propellant at a con- 
trollable rate into the image of the sun. The propellant 
itself would form the “‘combustion chamber.’”’ Very high 
temperatures could be used because the outer metallic 


VOLUME 22, NUMBER 5 


DISCUSSION 


walls of the chamber would be insulated from the high 
temperatures attained in the area of the solar image by 
the layer of propellant which remains outside of the image. 

Advantages of the utilization of solar energy in rockets 
over present propellants and over atomic energy would 
include, (1) decrease of the amount of fuel carried; (2) 
close control of the amount of propellant used; (3) attain- 
ment of higher temperatures in the target area than now 
practical; (4) simplification of conventional rocket equip- 
ment while the added weight due to the collecting appa- 
ratus would be comparatively low; and (5) avoidance of 
radiation shielding. 

No streamlining of the collector for solar energy would 
be required in the case of rockets or satellite stations in- 
tended to remain in space for long periods of time. It 
would be possible to use relatively large ‘outrigger’ 
mirrors which would not need to be of heavy construction 
if only small accelerations were contemplated. The collect- 
ing mirrors might be of rather large size produced in a 
single piece‘ or they might be built up from many small 
mirrors.® 

It is realized that a considerable amount of experimental 
and theoretical research and development work will be 
required for solving many of the problems indicated, but 
it is felt that the solution of these problems seems more 
feasible now that sufficient groundwork has been laid in the 


control of high temperatures produced by solar energy. 
As a first step a rocket would be designed in which solar 
energy would serve as a supplementary source of energy. 
A comparatively small and compact collector would 
suffice for this purpose. A later task would be the design 
of rockets and other devices in which solar energy would 
serve as the only source of energy. 


1 Seifert, Mills, and Summerfield, Am. J. Phys. 15, 1-21, 121-140, 
255-272 (1947). 

2W. M. Conn, Rev. Sci. Instr. 22, 945-951 (1951). 

3W.M. Conn, Bull. Am. Ceram. Soc. 33, 69-72 (1954). 

4W. M. Conn, ‘Principal points to be considered in the design of 
mirror-equipped solar furnaces for research and development work at 
high temperatures.” (To be published.) 

5 F. Trombe, Compt. rend. 235, 704-705, 1211-1213 (1952). 


Large-Screen Oscilloscope 


EUGENE T. PATRONIS, JR. 
Georgia Institute of Technology, Atlanta, Georgia 

HE advent of the mass-produced cathode-ray tube 
as used in commercial television has made it possible 
to construct economically a large-screen oscilloscope for 
lecture room demonstration purposes. Besides affording a 
large image, this oscilloscope produces an intense trace 
which may easily be seen in lecture halls even under adverse 
light conditions. In lecture demonstrations, the frequencies 
of interest are generally power and audio-frequencies. By 
restricting the instrument to operation in this range, the 
problems involved in magnetic deflection of the electron 

beam are somewhat simplified. 


341 





NOTES AND DISCUSSION 


3goov 


Fic. 1. Low-voltage and high-voltage power supply. T2 supplies 400-0-400 v rms at 250 ma, 6.3 v at 10 amp, and 
6.3 v at 0.6 amp. The 500-k potentiometer is adjusted for 300-v output from the regulated supply. L1 is a 4-henry 
choke of low resistance capable of carrying 250-ma. T3 is an rf high-voltage transformer which is commercially avail- 
able. The value of CT depends on the transformer used but is in the range 500-1000 micromicrofarads. The cathode-ray 
tube is the rectangular type 21YP4. The filter capacitors in the low voltage supply are 8 mfd, 600-volt units. 


1M 
40 (——) 40 


4M 6SF5 $2K 147M 


Ti-uTe- LSé6I 


Fic, 2. Vertical and horizontal amplifiers and linear time base generator. T1 is a Linear Standard Unit UTC-LS-61. The 
secondary impedances available will match any deflection coil. The sweep range may be extended by lowering the capaci- 
tance in the plate circuit of the 884. 





LETTERS FO 

The components of this oscilloscope are a low-voltage 
power supply, high-voltage power supply, vertical and 
horizontal amplifiers, linear time base generator, deflection 
yoke, and cathode-ray tube. Suggested circuits for these 
components are illustrated in Figs. 1 and 2. Other circuits 
may perform just as well; however, there are some design 
features inherent in the recommended circuits which are 
worth mentioning. The requirements of the horizontal 
amplifier are quite stringent. It must be capable of ampli- 
fying a sawtooth wave form of thirty cycles per second 
or lower without introducing an appreciable amount of 
distortion. In order to accomplish this, the transient re- 
sponse of the amplifier must be excellent. The critical 
element of this amplifier is the output transformer used 
to couple the power amplifier stage to the horizontal deflec- 
tion coils. The output transformer must have high primary 
inductance, low winding resistance, low leakage reactance, 
low core losses, and be incapable of saturation when the 
stage is operating at peak power. After much experimenta- 


AMERICAN JOURNAL OF PHYSICS 


VOLUME 22, 


THE EDITOR 343 


tion, the author has found the Linear Standard Series of 
the United Transformer Company is the best choice. 
Inferior units will not accomplish the task. The deflection 
yoke is a commercial unit employing the usual vertical 
and horizontal deflection coils. The necessity for magnetic 
deflection on large cathode-ray tubes limits the upper fre- 
quency at which the oscilloscope will operate successfully. 
When the fields of the deflection coils collapse, a large 
transient oscillation is produced in the resonant circuit 
formed by the coils and their distributed capacity. The 
recommended circuits have provisions for eliminating 
these undesirable oscillations. 

The scope is capable of displaying up to three cycles of 
60-cycle ac. A lower sweep rate will result in objectionable 
flicker as the cathode-ray tube is of the medium persistence 
type. A unit which is carefully constructed along the lines 
mentioned will perform satisfactorily for demonstration 
purposes throughout the entire audio-range. 


NUMBER 5 MAY, 1954 


LETTERS TO THE EDITOR 


Multiple Images in Plane Mirrors 


EMONSTRATING multiple images in plane mirrors 

at some angle with each other is usually done by 
placing some single object between the mirrors and pointing 
out that the number of images increases with decrease in 
angle. A blackboard demonstration shows which image is 
formed from which image, but students are usually 
troubled in the demonstration. It takes a little figuring to 
see which gives rise to which, especially if the original 
object possesses symmetry. This dilemma is alleviated by 
the following elementary device. 

Use an incandescent lamp as the object. Fasten to the 
lamp with Scotch tape several strips of colored Cellophane 
the like of which is found on lollypops. The successive 
images are now Clearly identified. Moreover, the pattern 


observed has kaleidoscopic properties and no small esthetic 
value. 

Juttus SUMNER MILLER 
El Camino College 

El Camino College, California 


Peculiar Properties of a Glass Rod 


WO of my colleagues in chemistry! have presented me 
with an optical problem which, they say, is out of 
their province. I communicate it to our readers with the 
feeling that they will find the problem interesting if not, 
indeed, intriguing. 
Secure from the chemistry laboratory a solid glass 
stirring rod. These are some 6 or 7 mm in diameter and 
8 or 9 inches long. Now letter carefully in red pencil the 


words CHOICE OXIDE; in blue pencil letter the words GLAss 
MATERIAL. Hold the rod horizontally above the words, 
some 3 or 4 mm above, so that the red and blue letters are 
read by transmission. The very peculiar phenomenon will 
now be evident. The red letters are unaltered but the blue 
ones are inverted! 

I shall be glad to receive formal discussion of the prob- 
lem. Another note is in preparation. 
JuLitus SUMNER MILLER 


El Camino College 
El Camino College, California 


1 Professor J. C. Hileman and Professor William Ruebsamen. 


An Electrical Method for Solving 
the Lens Equation 


HILE perusing a few back issues of the Journal, I 

came across Dr. Purbrick’s excellent ‘letter! and 

it occurred to me that a projection of this device another 

step would offer interest to both the student and applied 
physicist. 

Dr. Purbrick has shown that with three potentiometers 

connected in parallel, the currents can be made propor- 

tional to u, v, and f to satisfy the well-known lens equation: 


1/u+1/v=1/f. 


With suitable servomechanisms, these currents can be 
used to vary the lens to image, and object to lens distances 
such that focus is always maintained. This method allows 
the center potentiometer to be replaced by a fixed position. 
Furthermore, a scale to indicate magnification may easily 
be fixed to this assembly. 





344 LETTERS 


TO 

Such an electrical apparatus will present some ad- 
vantages over proposed mechanical systems.? Further, 
this presentation might benefit the student who feels a 


need to see basic laws in application. 


Donald J. LovELL 
Industrial Research Laboratories 


Hilltop and Frederick Roads 
Baltimore 28, Maryland 


1R. L. Purbrick, Am. J. Phys. 20, 376 (1952). 

2G. A. Fry, “A new method of producing continuously variable 
magnification with an optical system,"’ Wright Air Development Center, 
Technical Report 520279, October, 1952. 


On the Linearization of a Relativistic 
Hamiltonian 


ENRI Mitler' is correct in suspecting that the deduc- 


tion of Eq. (8) from Eq. (7) in his note of the above 
title is erroneous. It is true that 


A*—q?=q)qv, (1) 
(a,A +arigs) (aA +a2iqs) =A?—gq/, (2) 

and 
(saygy) (= aygy) =QrQv (v, w=1,2, 3). (3) 


However it would not be valid to deduce from these equa- 
tions that 


aA +axigs = taydy. (4) 


This is because the relation G=H does not necessarily 
follow from G? = H? when G and H are matrices. An example 
of this may be seen from the fact that the square of each 
one of Pauli’s three distinct spin matrices is the unit matrix. 

As a matter of fact, the conclusion which can be drawn 
from Eqs. (1), (2), and (3) is that 


(1A +e2tqs) = (@-q) (@-q) (a1A +e2%q4) 
= (aA +aziqs)~!(a@-q) (@-q). (5) 


If the right-hand side of Eq. (5) is equated to +a,q, the 
relations g;=0, g2=ig4 leading to the trivial result A =q; 
are obtained. 


f S. Y. Trwart AND B. BHATTACHARYA 
Universily of Rangoon, 
Burma 


1 Henri Mitler, Am. J. Phys. 21, 473 (1953). 


Changing Vectors 


N our year course in physical science, which is given for 
nonscience majors, we approach the subject of me- 
chanics from the astronomical-historical viewpoint. For 
instance, we show in a semiquantitative way how Newton 
arrives at the law of universal gravitation from his own 
three laws and Kepler’s three empirical rules about the 
motion of the planets. Consequently, it is necessary for us 
to show early in the course that a body following a circular 
path with constant speed has an acceleration, and many 
students are shocked by the idea that a vector—a concept 
to which they have just lately been introduced—changes 
when its direction changes. (One girl, however, got the 
idea so quickly that she remarked at the conclusion of the 
usual derivation that she was surprised that a body with 
constant speed could have a changing acceleration!) 


THE EDITOR 


In order to clarify the students’ thinking on the subject 
of the changes of vectors, the following instructional 
pattern is useful. First, the case of a falling body is dis- 
cussed ; here the change in velocity per unit time is in the 
same direction as the velocity. Next, the case of a body 
moving about the rim of a circle with constant speed is 
rediscussed; here the acceleration vector is always per- 
pendicular to the velocity vector. Finally, the case of ellip- 
tical motion in a central force field is compared with the 
first two cases. The acceleration vector points toward one 
focus of the ellipse, as Newton was able to show. A simple 
diagram should indicate that this vector can be divided 
into two perpendicular components: (1) along the line of 
the velocity vector, and (2) at right angles to the velocity 
vector. The vector (1) causes the speed to change in 
accordance with the law of areas; (2) appears to point 
toward the center of the (instantaneous) circle. The 
velocity vector of the ‘“‘planet’’ is seen to change in both 
magnitude and direction. 

IrvinG L. KorsKy AND KENNETH W. SHERK 
Physics Department 
Chemistry Department 


Smith College, 
Northampton, Massachusetts 


The Undergraduate Physics Laboratory 


CRITICISM which is often leveled at many under- 

graduate laboratory experiments in physics is that 
they have little or no connection with the type of problem 
with which a practicing physicist has to deal. This criticism 
is, I feel, frequently valid. Too often a student is given an 
instruction sheet which tells him not only what phe- 
nomenon he is to investigate, but the theory behind it, 
what measurements to make, and how to make them. 
Then the student is prodded, either by his written instruc- 
tions or by his instructor, to the one inescapable conclusion 
the experiment was designed to demonstrate. Has this 
student been helped in any way to pursue an investigation 
in a world devoid of instruction sheets and instructors? 

Perhaps this method of laboratory instruction can be 
defended on the basis that exigencies of time and space 
preclude giving laboratory problems to undergraduates 
which require much thought as to the best experimental 
methods to employ, or what measurements are required, 
to shed light on the phenomenon in question. Sometimes 
the performing of a ‘‘cookbook”’ experiment may impress an 
important physical fact on the consciousness of a student 
while acquainting him with apparatus and techniques 
necessary to his training. This is probably true, but such 
a pedagogical procedure most certainly does not require 
the kind of thinking which is essential to the successful 
application of the scientific method. 

The following experiment, which has been in use at 
Georgia Institute of Technology in the undergraduate 
optics laboratory for two years, was designed to meet, in 
part, the criticisms just offered. 

The student is shown a student spectrometer (which he 
has learned to manipulate in a previous experiment) on 





RECENT. 


the table of which is mounted a single slit. The slit of the 
collimator is illuminated with light from a sodium vapor 
lamp and the slit on the table is set by eye so that its plane 
is normal to the axis of the collimator. The student is shown 
the single-slit diffraction pattern in the focal plane of the 
telescope objective and is asked to learn all he can, of a 
quantitative nature, about single-slit diffraction. Addi- 
tional equipment consists of three other single slits of 
differént widths, a mercury lamp with filters to isolate the 
5461A and 4359A lines, and a traveling microscope. Some- 
times the additional equipment is supplied at the outset, 
sometimes only when and if requested. The experiment 
is given before diffraction is discussed in class and the 
students are requested to proceed as far as possible on 
their own without consulting references. On the average, 
the performance of the experiment requires about three 
hours of laboratory time plus the time required for the 
analysis of the data and the preparation of a report. 

Many students go very quickly to the heart of the 
problem. They investigate the angular positions of the in- 
tensity maxima or minima or both, investigate quanti- 
tatively the effect of the width of the slit (measured with 
the traveling microscope), investigate the effect of changing 
wavelengths, and analyze the data, sometimes graphically, 
to deduce an empirical law equivalent to 


6; =kyd/a, 


AMERICAN JOURNAL OF PHYSICS 


MEETINGS 


VOLUME 22, 


345 


where 6; is the angular width of the central maximum, a 
is the width of the slit, and k a constant depending on the 
units. They will usually find that all of the maxima are of 
the same width except the central one which is twice the 
width of the others. Occasionally they investigate the effect 
of varying the width of the source slit. I have not been 
requiring a theoretical analysis of the problem, but theo- 
retical arguments, of varying degrees of plausibility, have 
frequently been forthcoming. One student, after having 
made a thorough analysis of his data, was able to show 
that the incident light was not normal to the plane of the 
slit, and to deduce by how much it departed from 
perpendicularity. 

Of course there are students in most classes who make 
a very poor attempt at an experiment of this kind. On the 
whole, however, the results have been most encouraging. 
The students who successfully deduce a law which fits 
all of their data seem to experience a healthy feeling of 
accomplishment. In fact they have frequently asked for 
more experiments like this to replace those of the usual 
cut-and-dried variety. I would certainly like to meet the 
demand, and will appreciate any suggestions from readers. 

One of the aims of physics teaching is to afford training 
in the methods and thought processes of scientific research. 
I believe that this goal can best be achieved through wise 


use of the student’s laboratory time. 


VERNON CRAWFORD 
Georgia Institute of Technology 
Allanta, Georgia 


NUMBER 5 


RECENT MEETINGS 


Minnesota Area Section 


The annual fall meeting of the Minnesota Area Associa- 
tion of Physics Teachers was held at the Duluth Branch 
of the University of Minnesota on October 17, 1953. 
President R. B. Hastincs presided. At the business 
meeting Professor PETER Fossum was elected Vice Presi- 
dent to fill the vacancy created by the resignation of 
Professor W. WYMAN. After an address of welcome from 
Provost R. W. DARLAND the following contributed papers 
were presented. The address by Dr. O. H. SMITH was by 
special invitation of the executive committee. 


1. Some problems in connection with the physics course 
for nonscience majors. HAROLD HUGHESDON AND GEORGE 
Stusss, College of St. Thomas. 


2. A Fresnel diffraction experiment for the advanced 
laboratory. BROTHER H. GERARD, St. Mary's College.— 
A 931A photomultiplier tube and galvanometer are used 
to measure the relative intensities of a Fresnel diffraction 
pattern produced by a straight edge. The photomultiplier 
tube is completely shielded except for a vertical 0.05-mm 
slit and is moved on accurate ways across the diffraction 
pattern by a micrometer screw. The galvanometer readings 
are plotted against the displacement of the tube. Relative 


intensities for the pattern are calculated from Cornu’s 
spiral and are matched to the experimental intensities. The 
theoretical and experimental intensities are found to be 
in good agreement out to about the fifth maximum of the 
diffraction pattern. 


3. Institutes for teachers of physics. J. W. Bucuta, 
University of Minnesota.—Many in colleges are very free 
in finding fault with science teaching in the high schools, 
and the preparation of the teachers who in many cases 
come from teacher training institutions. But not often do 
we in the colleges, or the universities especially, do more 
than complain about the situation. No doubt much of the 
complaint is not justified, nevertheless, we can undoubtedly 
improve the situation. To aid in improving the situation, 
a desire shared by both high school and college teachers, 
the Department of Physics at the University of Minnesota 
operated an institute for high school teachers during the 
past summer. It was sponsored by the Fund for the 
Advancement of Education. Its objectives—which briefly 
were only to aid the high school teacher in his subject 
matter field, and to make that aid such that it would be of 
practical value to him on the level at which he finds him- 
self—and some observations on the institute were described. 

A parallel institute sponsored by the National Science 





346 


Foundation was conducted for college teachers of physics, 
especially those from the smaller liberal arts colleges, 
teachers colleges, and smaller universities—those depart- 
ments in which there are usually not more than two men. 
Teachers in these institutions find it difficult to keep 
abreast of the research going on in the field. It is quite 
impossible for them to carry on research in the same 
manner that it is carried on in the larger universities and 
research centers. It is believed that a series of lectures and 
visits to research projects underway in the university 
would enhance the teaching of those who attended such 
an institute, aid them in reading the literature, and enable 
them to be more effective in encouraging qualified students 
to go on to graduate work. The lectures that were given, 
the manner of operating the institute, and some of the 
reactions of those who participated were described. 


4. Some experiments on ion-exchange membranes. 
R. B. Hastincs, Macalester College—For a long time the 
purification of water has been accomplished by the use of 
filtration beds or by filtration towers in which a granular 
resinous material is used. As the water seeps through the 
filtering units, a process known as ion exchange takes place 
and a large percentage of the impurities held in solution are 
removed. This method is effective but very slow. Also the 
resinous material eventually becomes saturated and re- 
quires frequent treatment by sulphuric acid to restore it. 
This is time-consuming and expensive. 

Some of these difficulties have been overcome by in- 
corporating the resinous material into the form of a mem- 
brane produced in sheets rather than in the conventional 
beds or granules. While the sheet and granular forms have 
many common properties, there are notable differences. 
Membranes for example are used in continuous operations 
whereas batch operation is customary with granular resins. 


AMERICAN JOURNAL OF PHYSICS 


ANNOUNCEMENTS AND 


VOLUME 


NEWS 


Membrane processes are accomplished by the expenditure 
of electrical energy, rather than by the use of chemical 
regenerants. 

The most important property of ionic membranes is 
their permeability to ions of one charge, and imperme- 
ability to ions of opposite charge. Thus, electrical current 
is carried through the membranes by ions of one charge 
only, whereas both positively and negatively charged ions 
carry current in solutions. The membranes function, there- 
fore, as a kind of ionic sieve which can be used for sorting 
positively and negatively charged particles in solution. 

Slides were used showing the following: the ion-exchange 
cell, the cell functioning in an experiment, the graphs 
showing de-ionization, and microphotographs showing the 
structure of some of the membranes made. 


5. A survey of some recent ideas about magnetism. 
O. H. Smitu, Macalester College and Hamline University.— 
The fundamental ideas involved in the Ewing Model for 
ferromagnetism have been declared destined to survive in 
main outline as our knowledge of magnetism advances. 
These elements have become the domains of Bozorth, 
Williams, Kittel, and others. They can account in some 
measure for the phenomena of remanance, coercivity and 
hysteresis as the Ewing elements did and in addition they 
can give a good account of the Barkhausen effect. There 
are probably two things that contribute to this effect. The 
first is the snapping of a moving Bloch wall past a defect in 
the crystal and the second is the sudden turning of a small 
domain remaining when the relationship between the 
energy of the moving wall and that required to turn the 
remaining part of an unaligned domain reaches a critical 
point. 

KENT H. BRACEWELL 
Secretary 


22, NUMBER 5 MAY, 1954 


ANNOUNCEMENTS AND NEWS 


Book Reviews 


Information Theory and Its Engineering Applications. 
D. A. BELL. Pp. 135+-viii, Figs. 30, 1423 cm. Pit- 
man Publishing Corporation, New York, 1953. Price 
$4.00. 


Information theory is a glamorous field of engineering 
or applied mathematics which has physical and philosoph- 
ical overtones. It includes some undisputed work which 
is useful in communications engineering. This work has 
cast new light on and unified many problems, and it has 
suggested a perhaps disappointingly few new practical sys- 
tems or devices. It also leads directly to the formulation of a 
host of problems, largely concerning nonlinear systems, 
which are extremely difficult to deal with. 

Beynond all this, information theory appears to be re- 
lated to physics through entropy, because mathematically 
like quantities called entropy appear in both. This very 
difficult and imperfectly understood aspect of informa- 
tion theory greatly interests the present author. Thus, he 


introduces entropy through a discussion of thermody- 
namics and statistical mechanics, and he requires the 
communication engineer to understand these before he is 
allowed to proceed. 

The author deals with a large number of topics beyond 
certain essential minimum considerations of band width, 
signaling speed, the effect of signal-to-noise ratio, and 
coding. The section on practical applications cites many 
communication systems or proposals very briefly. There are 
also a chapter on optimum linear filtering and five various 
appendixes. 

I think that it would be unwise for the otherwise unin- 
itiated reader to expect to get a clear understanding of 
any part of information theory from this book, and 
certainly he could not get it easily. For a person with some 
background, the book could serve as a sort of intellectual 
and moral try-your-strength, with which he might proceed 
warily, continually asking himself, have I been misled? 

J. R. PIERcE 
Bell Telephone Laboratories 





ANNOUNCEMENTS AND NEWS 


Statistical Thermodynamics. Second edition. ERwin 
SCHRODINGER. Pp. 95, Figs. 5. Cambridge University 
Press, New York, 1952. Price $1.75. 


In 1944 Professor Schrédinger gave a course of seminar 
lectures on statistical thermodynamics at the Dublin Insti- 
tute for Advanced Studies. One of the fortunate outcomes 
of these lectures was the publication of his little book on 
statistical thermodynamics by the Cambridge University 
Press. In this book the author, to quote his own words, 
attempts ‘‘to develop briefly one simple unified standard 
method capable of dealing, without changing the funda- 
mental attitude, with all cases (classical, quantum, Bose- 
Einstein, Fermi-Dirac, etc.) and with every new problem 
that may arise.”” That the attempt is so successful and that 
it is achieved in such short space—less than 100 pages— 
is a tribute to the author’s imagination and genius. 

However, it is only fair to warn the beginners in statis- 
tical mechanics that this is hardly the book for them until 
they have completed their basic training elsewhere. The 
treatment of many of the usual topics in statistical thermo- 
dynamics is severely condensed in order to make room for a 
critical treatment of the more subtle and difficult aspects 
of the subject. 

The first half of the book deals with the Gibbs en- 
semble of N identical macroscopic systems. From the very 
outset quantum-mechanical systems are envisaged, their 
states described by state functions, and their energies by 
corresponding eigenvalues. Classical systems are treated 
as limiting cases of quantum systems. The essential problem 
to be solved, of course, is the distribution of a fixed energy 
E among the N systems. Because of its extreme simplicity 
the method of most probable distribution is used initially 
to obtain the partition function of a system, and hence, its 
entire thermodynamics. There follows an excellent dis- 
cussion of the Nernst heat theorem. 

The need for a more rigorous method of solving the 
distribution problem is clearly indicated in the author’s 
discussion of fluctuations. As a consequence, the method 
of mean values due to Darwin and Fowler is introduced and 
the distribution problem is solved anew. The power of this 
method becomes quite apparent as does the mathematical 
tool of contour integration by steepest descent. To do 
justice to this section of the book, the reader should have 
some knowledge of the theory of functions of a complex 
variable. The reviewer would even suggest that students 
interested in the method of steepest descent for evaluating 
contour integrals would do well to consult such books as 
Watson’s Bessel Function or Copson’s Introduction to the 
Theory of Functions of a Complex Variable. 

The second half of the book is largely devoted to a 
thorough examination of the properties of a particular 
macroscopic system consisting of m identical particles 
without structure or interaction—an ideal quantum gas. 
The same mathematical tools originally used in obtaining 
the partition function for any macroscopic system what- 
ever are now used in the z-particle problem for a quite 
different purpose, that of evaluating a particular partition 
function. Again the great power of these tools is exhibited. 


347 


The two kinds of statistics, Bose-Einstein and Fermi- 
Dirac, flow out of the two different restrictions on the 
occupation numbers. The argument carries both statistics 
along with the use of a single + sign. One might well wish 
that the author at this point had lengthened the book by 
a few pages in order to show the relation between the re- 
strictions on the occupation numbers and the symmetry 
properties of the state function of the system. 

The practical equivalence of the two statistics at high 
temperatures and low densities (weak degeneration) is 
clearly shown as is also their profound difference at low 
temperatures and high densities (strong degeneration). 

The author makes no attempt to treat the more general 
n-particle problem with interaction between the particles. 
To do so would probably create more distraction than 
attraction. 

The problem of radiation with its dilemma of infinite 
zero-point energy is attacked in the last chapter. The 
dilemma is resolved in the manner of Born and Peng by 
having the oscillators in either one of two situations: 
excited or not excited. Whether this is more than a mere 
artifice to eliminate infinite zero-point energy remains 
to be seen. 

The second edition of this book differs from the first 
only in having an appendix attached to it. In this ap- 
pendix the author shows that it is not necessary to assume— 
as he does throughout the text—that a system always finds 
itself in a quantum state of sharply defined energy ; rather, 
there is only a probability of it being in this or that state. 
The complete thermodynamics of the system can be 
determined in terms of the distribution of amplitudes of 
wave functions. No assumptions need be made concerning 
individual energies. 

Only the most critical students will find the material 
in the appendix necessary; the majority will be highly 
satisfied with the assumptions and treatment given in the 
main body of the text. 

C. N. WALL 
University of Minnesota 


Nuclear Theory. Rosert G. Sacus. Pp. 383+xi. Addison- 


Wesley Publishing Company, Inc., Cambridge, 1953. 
Price $7.50. 


Despite much effort, it cannot be claimed’ that there 
exists a theory of the nucleus; but as the author explains, 
there does exist a widely accepted theoretical approach to 
the subject. This textbook is a critical account of the more 
established aspects of that approach. As such, it is neither 
a source book nor an exhaustive treatment of all that has 
been speculated about nuclear phenomena. It is, rather, a 
very carefully written analysis of those theoretical con- 
siderations which in the judgment of the author constitute 
the ground on which nuclear theory is being built. Here 
is to be found both the optimism of the researcher and the 
conservatism of the teacher. The result is good. 

Half of the book is on the two-body problem. Complex 
nuclei are the concern of the balance. It is the reviewer’s 





348 


opinion that the careful attention given to the two-body 
problem is entirely justified. He remembers well observa- 
tions by Professor Pauli, at the Copenhagen conference of 
1936, to the effect that the very lightest nuclei must be 
understood first! The researches of Wigner, Feenberg, and 
Breit corroborate this point of view. Though great masses 
of data have accumulated since 1936, those which we 
understand are largely related to the two-body problem. 

The understanding which is sought is at the level of 
nucleon-nucleon interactions and the discussion is limited 
to two-body interactions. The easy choice to ignore many- 
body interactions is deceptive, for there is nothing in the 
systematics of complex nuclei to justify the hypothesis, 
which is an impious hope at the best. From the deuteron 
and low-energy two-nucleon scattering are obtained certain 
average characteristics of the two-body interaction. As is 
well known, high-energy phenomena present many un- 
resolved difficulties. Undetermined are the shape and 
exchange character of the interaction, its velocity de- 
pendence, and its charge dependence. The information 
that is obtained from complex nuclei is wholly descriptive. 
It is not proving easy to wrest from an undefined problem 
anything but the most general characteristics! The obvious 
is clarified and little is established concerning the interac- 
tions. To gain more, we probably need a fundamental 
theory; for, as our author says, “it seems unlikely that 
attempts to account for the interactions between nucleons 
will meet with much success until a quantitative account 
can be given of the properties of a single nucleon.” 

After the two introductory chapters there are chapters 
on the ground state of the deuteron, excited states of the 
deuteron, S scattering by bound nuclei, and further con- 
siderations of the two-body problem (neutron-proton, 
proton-proton scattering; photodisintegration; capture of 
neutrons by protons). The experimental facts are clearly 
presented and the values of the scattering lengths, effective 
ranges, and other low-energy parameters are evaluated in 
a manner as to summarize both the results of experiment 
and conclusions of theory. Tables for the parameters of 
the Feshbach-Schwinger solution of the deuteron ground- 
state problem for a Yukawa potential including tensor 
interaction are given in an appendix. A qualitative dis- 
cussion is attempted of high-energy phenomena with little 
reward. 

There is interposed a short chapter on isotopic spin and 
the meson theory of nuclear forces. The generalized Pauli 
principle is introduced, with very little discussion, and 
the familiar identities are obtained for the exchange 
operators. The principle that the nucleon wave function 
must be antisymmetric under the interchange of any pair 
of nucleons stands in need of much more careful scrutiny 
than is here given. The discussion of field theory is very 
casual but not unwelcome as a reminder of a theory of 
fundamental particles yet to be developed. 

The next three chapters are on the structure of complex 
nuclei, electromagnetic interactions of nuclei, and nuclear 
reactions. A final and rather inadequate chapter is con- 
cerned with beta decay. There is a discussion of the nuclear 
three-body systems followed by the essential features of 


ANNOUNCEMENTS AND NEWS 


Wigner’s supermultiplet theory. The independent particle 
model with both Russell-Saunders coupling and Mayer- 
Jensen coupling is then discussed in a most satisfactory 
manner. The chapter on electromagnetic interactions treats 
a familiar subject in a most unfamiliar manner. The 
chapter on nuclear reactions is a faithful account of the 
Wigner-Eisenbud theory. 


JuLiaAN K. Knipp 
Iowa State College 


Experimental Nuclear Physics. Vol. II. E. SEGRE. Pp. 569 
+viii, Figs. 117, John Wiley and Sons, Inc., New 
York, 1953. Price $12.00. 


This is the second in a series of three volumes edited by 
E. Segré, which covers the field of nuclear physics. The 
second volume contains two separate and independent 
parts; the first is a Survey of Nuclear Reactions by P. 
Morrison, and the second part concerns The Neutron and 
is written by B. T. Feld. The first part of the volume is a 
theoretical discussion of nuclear reactions and presents the 
theory in a lucid fashion; selected experimental results 
are given as examples which are compared with the theory. 
It is not a compilation of all the nuclear reactions which 
have been studied. In some places experimental data are 
not given where their appearance would have been de- 
sirable. For example, a ‘‘schematized’’ gamma-ray yield 
curve is given (p. 95) where an experimental curve would 
have been much better. In the theoretical discussion of 
stripping it is unfortunate that only one paragraph is 
devoted to Butler’s theory of stripping which has been so 
successful a tool in determining the total angular mo- 
mentum of excited states of light nuclei. There are a few 
minor errors in the text which may detract from its use by 
unwary students. For example, the curves showing barrier 
heights for protons (Fig. 12c) and alpha particles (Fig. 
13c) give barrier heights which are twice as large for 
protons as for alpha particles. 

The second half of the volume is an excellent monograph 
on neutron physics, which combines a comprehensive 
theoretical treatment of the subject with a review of the 
latest experimental results. It begins with a brief historical 
summary of the discovery of the neutron and continues 
with its properties and fundamental interactions. There are 
extensive sections on neutron interactions with nuclei and 
neutron sources and detectors. There are briefer treat- 
ments of neutron diffusion and elementary pile theory. 
The treatise ends with a section on Coherent Scattering 
Phenomena with Slow Neutrons; this includes neutron 
diffraction, nuclear scattering amplitudes, and magnetic 
scattering. 

The technical make up of the book is good, although 
the reviewer dislikes having most of the references placed 
at the back of the book instead of at the bottom of each 
page. In conclusion, the volume should be a very useful 
one to both advanced students and research workers in 
the field of nuclear physics. 

T. W. BoNNER 
The Rice Institute, Texas 





ANNOUNCEMENTS AND NEWS 


Radioactivity and Radioactive Substances. Sir JAMES 
CHADWICK. Fourth edition. Revised and supplemented 
by J. Rorsiat. Pp. 120, Plates 4, Figs. 41. Pitman 
Publishing Corporation, New York, 1953. Price $3.00. 


As may be surmised this small book is identical in many 
ways with the third edition of 1947. However, numerous 
minor changes have been made to bring the subject matter 
within the accuracy of current data and within the view- 
point of present theory. The radioactive substances dealt 
with are nearly exclusively those of the natural radioactive 
series. One does not obtain even a suspicion that besides 
these 47 natural radioactive isotopes there are over 1000 
new radioactive isotopes that also have peculiar and in- 
teresting modes of decay. Electron capture is not men- 
tioned. The book is clearly one that describes the research 
interests and conclusion arrived at years ago. If it were to 
do otherwise the carryover of the title of the book would 
not have been justified. 

There is a new chapter added to those in the older 
edition ; however, this last chapter of the book is on nuclear 
structure and the theory of transformation. The subject 
is very briefly and clearly presented in the most elementary 
manner. For example, two pages are devoted to artificial 
radioactivity and nearly two to nuclear fission. 

If one wishes to obtain quickly a very succinct picture of 
the main features of natural radioactivity and the ele- 
mentary tools used some years ago, then this book is highly 
recommended. The book may be considered as desirable 
collateral reading of a few hours at the beginning of an 
introductory course in nuclear physics. The book is much 
too short and too much of a survey type to be used as a 
text. This fourth edition, as well as the others, is an abbre- 
viated readable version of the classic Radiations and Radio- 
active Substances by Rutherford, Chadwick, and Ellis. 

M. L. Poon 
Ohio State University 


Practical Aids for Teachers of Physics 


Current Drawn in Starting an Automobile 
Engine 


Students have often expressed considerable interest in 
measuring the current drawn from a battery in starting the 
engine of an automobile. The procedure described here 
can be carried out by the average student in the elementary 
course. 

The battery ground strap is removed, and two wires, 
say, six inches apart, are soldered to it. The resistance of 
the strap between these two wires is measured with a 
Kelvin bridge, the ends of the strap being connected to the 
current leads and the wires to the potential leads of the 
instrument. The strap is replaced in the car, with a volt- 
meter attached to the two wires, and the engine started. 
The momentary deflection of the meter’s needle can be 
read without difficulty. Typical values obtained from a 
student’s car, prewar model, were resistance of strap, 
1.1X10-* ohm; potential difference, 0.020 volt; starting 


349 


current, approximately 180 amp. (Contributed by GEORGE 
B. WEtcH, Northeastern University, Boston 15, Massa- 
chusetts.) 


The Weighted Meter Stick 


In most elementary physics laboratories there is an ex- 
perimental study of parallel forces, usually applied to a 
meter stick. The experiment usually consists of a study of 
moments, and a determination of the center of gravity and 
weight of the meter stick. 

In our general laboratory we have for some years used 
a set of “‘loaded”’ meter sticks. These are the regular maple 
meter sticks with holes bored in one end and filled with 
lead. This can easily be done either by filling the hole with 
melted lead or by pounding lead shot into the holes. In the 
first case care must be taken not to have the lead so hot 
it will burn the wood and in the second case the wood may 
be split. We have filled our sticks by boring in from the end 
so that the loading will not be obvious. Enough lead can 
easily be added to shift the center of gravity as much as 
10 cm or more. 

We usually have the students determine the center of 
gravity by hanging the stick in various positions, then 
checking it by the method of torques. 


The following advantages have been found in using the 
loaded meter stick. 


(1) The students usually do not know that the stick 
“is loaded”’ since it is done neatly in one end, and are 
mildly surprised to discover that the center of gravity is 
not in the (geometric) center. ‘‘Naturally” they look for the 
cause. The element of surprise adds interest. It is a good 
chance for the instructor to mention loaded dice, un- 
reliable balances, etc. 

(2) Modern production methods turn out such a uniform 
product that there is little possibility that the center of 
gravity of an unaltered stick would be far from the center 
(geometric). The students accordingly anticipate this and 
usually end up by averaging four or five determinations of 
50.0 cm. Also it seems a little trite to ‘find’ something 
that is already known. For the sticks that we use each has 
an indentification number and each a different center of 
gravity. 

(3) From a teaching point of view, the lack of symmetry 
has real merit. If the stick is pivoted at the geometric 
center, the weight of the stick does enter ‘into torque 
problems. Another possibility is to compute the amount of 
lead in the end from the shift in the center of gravity. 
This can be treated as a simple problem in composite 
figures. 

(4) The weighted stick is the logical introduction to beam 
balances, weighing devices, levers, etc, most of which are 
not symmetrical. 

(5) A half-meter stick can be weighted just right so that 
it will float upright in water, making an excellent im- 
provised lecture table hydrometer. Hydrometer problems 
are very instructive and are less difficult if the hydrometer 
has a uniform cross section. This introduces the ideas of 
center of buoyancy and metacenter. 





350 


(6) The present student generation that seems so poorly 
prepared in vocabulary as well as in mathematics can well 
relearn the ideas of “carrying more weight,’’ ‘“‘weighted”’ 
averages, etc., and that some things are of ‘‘more moment”’ 
than others. 

(7) The change to loaded meter sticks costs only a little 
time, and it adds much. (Contributed by GRANT O. GALE, 
Grinnell College, Grinnell, Iowa.) 


An Inexpensive Shockmount 


An effective shockmount may be devised from a short 
length of garden hose. A part of the cylinder is removed by 
cutting parallel to or along a diameter for some distance 
along the axis of the cylinder. A lateral cut removes the 
unwanted portion of the piece of hose, leaving an “ear” 
on the end of the cylinder. Similar cuts are made on the 
other end of the cylinder. This leaves two “‘ears”’ displaced 
180 degrees with respect to each other. One “ear’’ is 
attached to the supporting member and the other to the 
article to be mounted. 

A useful variation is obtained by using a ‘‘one-eared”’ 
cylinder, i.e., cutting away only one part of the cylinder. 
A slit in the uncut portion of the cylinder will hold a 
machine screw very nicely. The shockmounted article is, 
of course, attached to the machine screw. Such an arrange- 
ment permits easy removal or dismantling of the shock- 
mounted device. 

By varying the sizes of the cuts, the shock resistance can 
be adjusted over a considerable range. I have used this 
arrangement to shockmount chassis weighing up to six 
pounds. For a chassis weighing two pounds, four pieces of 
garden hose each about two inches long was found to be 
very satisfactory. For very light articles, gas or water 
tubing should be substituted for garden hose. 

Incidentally, an analysis of the allowed modes of vibra- 
tion for this configuration yields interesting results. (Con- 
tributed by WARREN J. DESHOTELS, International Testing 
Service, 321 N. Hamilton Street, Saginaw, Michigan.) 


New Members of the Association 


The following persons have been made members or junior members 
(J) of the American Association of Physics Teachers since the publica- 
tion of the preceding list [Am. J. Phys. 22, 244 (1954) ]. 

Arlotto, Joseph J., 145 Pleasant Street, Utica, N. Y. 
Bershader, Daniel, Palmer Physical Laboratory, Prince- 

ton, N. J. 

Bowyer, Allen Frank (J), 208 Newton Hall, Angwin, Calif. 
Buford, Phillip N., 3417 Tuland Drive, West Hyattsville, 
Md. 


Chen, Joseph Hsiu (J), St. Procopius College, Lisle, Il. 


ANNOUNCEMENTS AND NEWS 


Clemont, Willis Hawkins, St. Paul’s Polytechnic Institute, 
Lawrenceville, Va. 

Clendenen, Clifford L. (J), 204 Newton Hall, Angwin, 
Calif. 

Cole, Edgar Marion, 1821 Savannah Street, S.E. No. 304, 
Washington, D. C. 

Crawford, John Eric (J), 4 Norma Crescent, Toronto, 
Ontario, Canada. 

Daly, William Anthony (J), 1505 Glenwood Drive, San 
Diego, Calif. 

Davis, Edward Alex (J), 2619 Thelma, Houston, Texas. 

Ellis, George Abdo (J), 505 Mary Street, Utica, N. Y. 

French, Walter Russell, Jr., Nebraska Wesleyan Univ., 
Lincoln 4, Neb. 

Green, Thomas Myer (J), Apt. D, 4 Webster Avenue, 
Hanover, N. H. 

Hamilt, Arthur (J), 54 Nahant Avenue, Revere, Mass. 

Hazdra, James Joseph (J), 2316 S. 59th Court, Cicero 50, 
Ill. 

Hendershot, Otis P., The Citadel, Apt. 10C, Charleston, 
S..€. 

Huebner, Albert (J), 743 Bradford Street, Brooklyn 7, 
N.Y. 

Jachlewski, Richard Henry (J), 48 Sachem, Hanover, 
N. H. 

Jacobs, Stephen Frank (J), Dept. of Physics, Johns Hop- 
kins University, Baltimore, Md. 

Jaffe, Sherman (J), 259-19 148 Road, Rosedale, Long 
Island, N. Y. 

Johnson, Benjamin Richard, The Citadel, Charleston, 
S.C. 

Jordan, Edward Daniel (J), 427 Shelton Street, Bridge- 
port 8, Conn. 

Levine, Nathan (J), 507 E. Green Street, Champaign, III. 

Long, Vernon Lovell, 1445 Bertha-Beaverton Highway, 
Beaverton, Ore. 

Manning, Irwin (J), 
Cambridge 39, Mass. 

Maurer, Robert Joseph, 509 W. Iowa, Urbana, III. 

Mracek, Olga Mary (J), 973 Bloor St. W, Toronto 4, 
Ontario, Canada. 

Powers, Glenn F., 2203 Ada Street, Conway, Ark. 

Ross, John Stoner, 1311 Pennsylvania Avenue, Winter 
Park, Fla. 

Rouse, Carl Albert (J), 1541 Lombardy Road, Pasadena 4, 
California. 

Rutledge, Delbert Leroy, 222 East 9th, Edmond, Okla. 

Segerman, Ephraim, c/o Knapp, 644 E. 5th St., New 
York, N. ¥. 

Taylor, Jeremy Phil (J), 2394-3 Patterson, Eugene, Ore. 

Tinker, Evart A. (J), Rm. 302 Newton Hall, Angwin, 
Calif. 

Wong, P. K. (J), P.O. Box 164, Angwin, Calif. 

Woods, Roy A., 921 Craten Road, Norfolk 13, Va. 


M.I.T. Graduate House 405B, 





New teats from Macmillan 


The Theory of the Photographic Process 


second edition 


by C. E. Kenneth Mees 


Here is an up-to-date revision of a well-known book dealing 
with the composition and properties of photographic light- 
sensitive materials; the factors which control their sensitivity 
to light; the changes induced in them by the action of light; 
the nature of the process of development ; the properties of the 
final image; and the measurement of its tone values. Three 
new chapters include: The Action of Charged Particles on The 
Photographic Emulsion, The Latent Image Produced by X-Rays, 
and The Sensitometry of Color Films and Papers. Complete 
bibliographies accompany each chapter. Ready June 1954 


The Determination of Crystal Structures 
Volume III of The Crystalline State edited by Sir Lawrence Bragg 


by H. Lipson and W. Cochran 


Here is a complete coverage of the subject of crystal-structure 
determination from the stage at which a set of structure 
amplitudes has been obtained to the final accurate positioning 
of the atoms. The routine steps such as the calculation of 
structure factors and the summation of Fourier series are 
described first, and the last five chapters discuss the more 
difficult problems of the derivation of atomic positions with 
some degree of logical order. However, the authors emphasize 
throughout that each problem requires individual treatment, 
and that no single sequence of methods is likely to prove. 
universally successful. ’ 

Published February 1954, 345 pp. $8.00 


Lhe Macmillan 


60 FIFTH AVENUE, NEW YORK 1} 


AMERICAN JOURNAL OF PHYSICS MAY, 1954 





CONTENTS 


AMERICAN JOURNAL OF PuHysIcs VOLUME 22, NuMBER 5 May 1954 


Reproductions of Prints, Drawings, and Paintings of Interest in the History of Physics. 61. 
Caricatures of Sir Isaac Newton by Two Famous Artists 


Effective Mass of Electrons in Crystals C. Kittel 
Note on the Study of Uniformly Accelerated Motion ........ . . HenryS.C. Chen 
Valence Bond Calculations L. A. Schmid 
Techniques and Applications of Neutron Diffraction M. K. Wilkinson 
Photonuclear Reactions Arthur I. Berman 
Systems of Units in Mechanics—A Summary M. Loren Bullock 
Displacement Current, A Useless Concept F. W. Warburton 
Experiments on the Laws of Light Absorption Otto Blih and Howard Ko 
Physics of the Glassy State. IV. Radiation-Sensitive Glasses E. U. Condon 
On the Theory of the Van de Graaff Electrostatic Generator 
Elementary Derivation of Equations for Wave Speeds Austin J. O’Leary 
Interpretation of the Variable Mass in Wessel’s Theory of the Electron . . . . S. J. Czyzak 
Notes and Discussion: 

Note on the Utilization of Radiant Energy from the Sun in Rockets 

Large-Screen Oscilloscope 
Letters to the Editor: 

Multiple Images in Plane Mirrors Julius Sumner Miller 

Peculiar Properties of a Glass Rod Julius Sumner Miller 

Electrical Method for Solving the Lens Equation Donald J. Lovell 

On the Linearization of a Relativistic Hamiltonian S. Y. Tiwari and B. Bhattacharya 

Irving L. Kofsky and Kenneth W. Sherk 

Undergraduate Physics Laboratory Vernon Crawford 
Recent Meetings: 

Minnesota Area Section 
Announcements and News: 

Symposium on Molecular Structure and Spectroscopy 

Iowa Colloquium of College Physicists 

June Meeting of the AAPT 

Errata: Nuclear Quadrupole Resonance 

Motion Pictures for Classroom Use 

Western Michigan College and Kalamazoo College 

Professor O. H. Blackwood 


Practical Aids for Teachers of Physics 
New Members of the Association 


247 
250 
253 
255 


277 


299 





